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<210> 1 

<211> 849 

<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> CDS 

<222> (1)..(849) 

<223> Acyl-CoA: lysophospholipid acyltransf erase 



<400> 1 

atg gag aac ttc tgg teg ate gtc gtg ttt ttt eta etc tea att etc 48 
Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
15 10 15 

ttc att tta tat aac ata teg aca gta tgc cac tac tat atg egg att 96 
Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 ■ 25 30 

teg ttt tat tac ttc aca att tta ttg cat gga atg gaa gtt tgt gtt 144 
Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 

aca atg ate cct tct tgg eta aat ggg aag ggt get gat tac gtg ttt 192 
Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 

cac teg ttt ttc tat tgg tgt aaa tgg act ggt gtt cat aca aca gtc 240 
His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 



tat gga tat gaa aaa 
Tyr Gly Tyr Glu Lys 
85 



aca caa gtt gaa ggt ccg get gta 
Thr Gin Val Glu Gly Pro Ala Val 
90 



gtt att tgt 
Val He Cys 
95 



288 



2 



aat cat cag agt tct etc gac att eta teg atg gca tea ate tgg ccg 336 
Asn His Gin Ser Ser Leu Asp lie Leu Ser Met Ala Ser He Trp Pro 
100 105 110 

aag aat tgt gtt gta atg atg aaa cga att ctt gec tat gtt cca ttc 384 
Lys Asn Cys Val Val Met Met Lys Arg He Leu Ala Tyr Val Pro Phe 
115 120 125 

ttc aat etc gga gec tac ttt tec aac aca ate ttc ate gat cga tat 432 
Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr He Phe He Asp Arg Tyr 
130 135 140 

aac cgt gaa cgt gcg atg get tea gtt gat tat tgt gca tct gaa atg 480 
Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 

aag aac aga aat ctt aaa ctt tgg gta ttt ccg gaa gga aca aga aat 52 8 

Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 

cgt gaa gga ggg ttc att cca ttc aag aaa gga gca ttc aat att gca 576 
Arg Glu Gly Gly Phe He Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 190 

gtt cgt gcg cag att ccc att att cca gtt gta ttc tea gac tat egg 624 
Val Arg Ala Gin He Pro He He Pro Val Val' Phe Ser Asp Tyr Arg 
195 200 205 

gat ttc tac tea aag cca ggc cga tat ttc aag aat gat gga gaa gtt 672 
Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 

gtt att cga gtt ctg gat gcg att cca aca aaa ggg etc act ctt gat 720 
Val He Arg Val Leu Asp Ala He Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 

gac gtc age gag ttg tct gat atg tgt egg gac gtt atg ttg gca gee 768 
Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 

tat aag gaa gtt act eta gaa get cag caa cga aat gcg aca egg cgt 816 
Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 

gga gaa aca aaa gac ggg aag aaa tct gag taa 849 
Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 2 

<211> 282 

<212> PRT 

<213> Caenorhabditis elegans 



<400> 2 



3 



Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
1 .5 10 15 



Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 25 30 



Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 



Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 



His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 



Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val He Cys 
85 90 95 



Asn His Gin Ser Ser Leu Asp He Leu Ser Met Ala Ser He Trp Pro 
100 105 110 



Lys Asn Cys Val Val Met Met Lys Arg He Leu Ala Tyr Val Pro Phe 
115 120 125 



Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr He Phe He Asp Arg Tyr 
130 135 140 



Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 



Lys. Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 



Arg Glu Gly Gly Phe He Pro Phe Lys Lys Gly Ala Phe Asn He Ala 
180 185 190 



Val Arg Ala Gin He Pro He He Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 



Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 



Val lie Arg Val Leu Asp Ala lie Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 



Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 



Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 



Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 3 

<211> 849 

<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> CDS 

<222> (1) ... (849) 

<223> Acyl-CoA: lysophospholipid acyltransf erase 



<400> 3 

atg gag aac ttc tgg teg ate gtc gtg ttt ttt eta etc tea att etc 

Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
1 5 10 15 

ttc att tta tat aac ata teg aca gta tgc cac tac tat atg egg att 
Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 25 30 

teg ttt tat tac ttc aca att tta ttg cat gga atg gaa gtt tgt gtt 
Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 

aca atg ate cct tct tgg cta.aat ggg aag ggt get gat tac gtg ttt 
Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 

cac teg ttt ttc tat tgg tgt aaa tgg act ggt gtt cat aca aca gtc 
His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 

tat gga tat gaa aaa aca caa gtt gaa ggt ccg get gta gtt att tgt 
Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val He Cys 
85 90 95 

aat cat cag agt tct etc gac att eta teg atg gca tea ate tgg ccg 
Asn His Gin Ser Ser Leu Asp He Leu Ser Met Ala Ser He Trp Pro 
100 105 110 



aag aat tgt gtt gta atg atg aaa cga att ctt gec tat gtt cca ttc 



5 



Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 

ttc aat etc gga gec tac ttt tec aac aca ate ttc ate gat cga tat 432 
Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 

aac cgt gaa cgt gcg atg get tea gtt gat tat tgt gca tct gaa atg 480 
Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 

aag aac aga aat ctt aaa ctt tgg gta tct ccg gaa gga aca aga aat 528 
Lys Asn Arg Asn Leu Lys Leu Trp Val Ser Pro Glu Gly Thr Arg Asn 
165 170 175 

cgt gaa gga ggg ttc att cca ttc aag aaa gga gca ttc aat att gca 576 
Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 190 

gtt cgt gcg cag att ccc att att cca gtt gta ttc tea gac tat egg 624 
Val Arg Ala Gin He Pro He He Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 

gat ttc tac tea aag cca ggc cga tat ttc aag aat gat gga gaa gtt 672 
Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 

gtt att cga gtt ctg gat gcg att cca aca aaa ggg etc act ctt gat 720 
Val He Arg Val Leu Asp Ala lie' Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 

gac gtc age gag ttg tct gat atg tgt egg gac gtt atg ttg gca gee 768 
Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 

tat aag gaa gtt act eta gaa get cag caa cga aat gcg aca egg cgt 816 
Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 

gga gaa aca aaa gac ggg aag aaa tct gag taa 849 
Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 4 

<211> 282 

<212> PRT 

<213> Caenorhabditis elegans 

<400> 4 

Met Glu Asn Phe Trp Ser He Val Val Phe Phe Leu Leu Ser He Leu 
1 5 10 15 



Phe He Leu Tyr Asn He Ser Thr Val Cys His Tyr Tyr Met Arg He 
20 25 30 
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Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 ,40 45 



Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 



His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 



Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val He Cys 
85 90 95 



Asn His Gin Ser Ser Leu Asp He Leu Ser Met Ala Ser He Trp Pro 
100 105 110 



Lys Asn Cys Val Val Met Met Lys Arg He Leu Ala Tyr Val Pro Phe 
115- 120 125 



Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr He 1 Phe He Asp Arg Tyr 
130 135 140 



Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 



Lys Asn Arg Asn Leu Lys Leu Trp Val Ser Pro Glu Gly Thr Arg Asn 
165 170 175 



Arg Glu Gly Gly Phe He Pro Phe Lys Lys Gly Ala Phe Asn He Ala 
180 185 190 



Val Arg Ala Gin He Pro He He Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 



Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 



Val He Arg Val Leu Asp Ala He Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 * 235 240 



Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 
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Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 



Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 5 

<211> 849 

<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> CDS 

<222> (1) . . (849) 

<223> Acyl-CoA: lysophospholipid acyltransf erase 



<400> 5 

atg gag aac ttc tgg teg ate gtc gtg ttt ttt eta etc tea att etc 48 
Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
1 5 10 15 

ttc att tta tat aac ata teg aca gta tgc cac tac tat gtg egg att 96 
Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Val Arg lie 
20 25 30 

teg ttt tat tac ttc aca att tta ttg cat gga atg gaa gtt tgt gtt 144 
Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 

aca atg ate cct tct tgg eta aat ggg aag ggt get gat tac gtg ttt 192 
Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 

cac teg ttt ttc tat tgg tgt aaa tgg act ggt gtt cat aca aca gtc 240 
His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 

tat gga tat gaa aaa aca caa gtt gaa ggt ccg get gta gtt att tgt 288 
Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val lie Cys 
85 90 95 

aat cat cag agt tct etc gac att eta teg atg gca tea ate tgg ccg 336 
Asn His Gin Ser Ser Leu Asp lie Leu Ser Met Ala Ser lie Trp Pro 
100 105 110 

aag aat tgt gtt gta atg atg aaa cga att ctt gee tat gtt cca ttc 384' 
Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 



ttc aat etc gga gee tac ttt tec aac aca ate ttc ate gat cga tat 
Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 



432 



8 



aac cgt gaa cgt gcg atg get tea gtt gat tat tgt gca tct gaa atg 480 
Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 . 160 

aag aac aga aat ctt aaa ctt tgg gta ttt ccg gaa gga aca aga aat 52 8 

Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 

cgt gaa gga ggg ttc att cca ttc aag aaa gga gca ttc aat att gca 576 
Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 190 

gtt cgt gcg cag att ccc att att cca gtt gta ttc tea gac tat egg 624 
Val Arg Ala Gin lie Pro lie lie Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 

gat ttc tac tea aag cca ggc cga tat ttc aag aat gat gga gaa gtt 672 
Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 

gtt att cga gtt ctg gat gcg att cca aca aaa ggg etc act ctt gat 720 
Val lie Arg Val Leu Asp Ala lie Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 

gac gtc age gag ttg tct gat atg tgt egg gac gtt atg ttg gca gee 768 
Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 • 255 

tat aag gaa gtt act eta gaa get cag caa cga aat gcg aca egg cgt 816 
Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 

gga gaa aca aaa gac ggg aag aaa tct gag taa 849 
Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 6 
<211> 282 
<212> PRT 

<213> Caenorhabditis elegans 
<400> 6 

Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
15 10 15 



Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Val Arg lie 
20 25 30 



Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 



Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
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50 



55 



60 



His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 



Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val lie Cys 
85 90 95 



Asn His Gin Ser Ser. Leu Asp lie Leu Ser Met Ala Ser lie Trp Pro 
100 105 110 



Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 



Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 



Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 



Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 



Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 190 



Val Arg Ala Gin He Pro He He Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 



Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 



Val He Arg Val Leu Asp Ala He Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 



Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 



Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 . 265 270 



Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



10 



<210> 7 

<211> 849 

<212> DNA 

<213> Caenorhabditis elegans 
<220> 

<221> CDS 

<222> (1)..(849) 

<223> Acyl-CoA: lysophospholipid acyltransf erase 



<400> 7 

atg gag aac ttc tgg teg ate gtc gtg ttt ttt eta etc tea att etc 
Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
15 10 15 

ttc att tta tat aac ata teg aca gta tgc cac tac tat atg egg att 
Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 25 30 

teg ttt tat tac ttc aca att tta ttg cat gga atg gaa gtt tgt gtt 
Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 

aca atg ate cct tct tgg eta aat ggg aag ggt' get gat tac gtg ttt 
Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 60 

cac teg ttt ttc tat tgg tgt aaa tgg act ggt gtt cat aca aca gtc 
His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 

tat gga tat gaa aaa aca caa gtt gaa ggt ccg gee gta gtt att tgt 
Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val lie Cys 
85 90 95 

aat cat cag ggt tct etc gac att eta teg atg gca tea ate tgg ccg 
Asn His Gin Gly Ser Leu Asp lie Leu Ser Met Ala Ser lie Trp Pro 
100 105 110 

aag aat tgt gtt gta atg atg aaa cga att ctt gee tat gtt cca ttc 
Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 

ttc aat etc gga gee tac ttt tec aac aca ate ttc ate gat cga tat 
Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 

aac cgt gaa cgt gcg atg get tea gtt gat tat tgt gca tct gaa atg 
Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu' Met 
145 150 155 160 



aag aac aga aat ctt aaa ctt tgg gta ttt ccg gaa gga aca aga aat 
Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 
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cgt gaa gga ggg ttc att cca ttc aag aaa gga gca ttc aat att gca 576 
Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
■ 180 185 190 

gtt cgt gcg cag att ccc att att cca gtt gta ttc tea gac tat egg 624 
Val Arg Ala Gin lie Pro lie lie Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 

gat ttc tac tea aag cca ggc cga tat ttc aag aat gat gga gaa gtt 672 
Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 

gtt att cga gtt ctg gat gcg att cca aca aaa ggg etc act ctt gat 720 
Val lie Arg Val Leu Asp Ala lie Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 . 240 

gac gtc age gag ttg tct gat atg tgt egg gac gtt atg ttg gca gee 768 
Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 

tat aag gaa gtt act eta gaa get cag caa cga aat gcg aca egg cgt 816 
Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 . 265 270 

gga gaa aca aaa gac ggg aag aaa tct gag taa 849 
Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 8 

<211> 282 

<212> PRT 

<213> Caenorhabditis elegans 

<400> 8 

Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
1 5 .10 15 



Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 25 30 



Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 



Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 

50 55 . 60 



His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 
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Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val lie Cys 
85 90 • 95 

Asn His Gin Gly Ser Leu Asp lie Leu Ser Met Ala Ser lie Trp Pro 
100 105 110 

Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 

Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 



Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 



Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 



Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 • 190 



Val Arg Ala Gin lie Pro lie lie Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 



Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 



Val lie Arg Val Leu Asp Ala lie Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 



Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 



Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265 270 



Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 9 

<211> 1578 

<212> DNA 

<213> Physcomitrella patens 
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<220> 

<221> CDS 

<222> (1)..(1578) 

<223> Delta- 6-desaturase 



<400> 9 

atg gta ttc gcg ggc ggt gga ctt cag cag ggc tct etc gaa gaa aac 
Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
1 5 10 .15 

ate gac gtc gag cac att gec agt atg tct etc ttc age gac ttc ttc 
lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 

agt tat gtg tct tea act gtt ggt teg tgg age gta cac agt ata caa 
Ser Tyr Val Ser Ser Thr Val Gly Ser. Trp Ser Val His Ser lie Gin 
35 40 45 

pet ttg aag cgc ctg acg agt aag aag cgt gtt teg gaa age get gec 
Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 

gtg caa tgt ata tea get gaa gtt cag aga aat teg agt ace cag gga 
Val Gin Cys He Ser Ala Glu Val Gin Arg Asn Ser Ser Thr Gin Gly 
65 70 75 80 

act gcg gag gca etc gca gaa tea gtc gtg aag ccc acg aga cga agg 
Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 
85 90 95 

tea tct cag tgg aag aag teg aca cac ccc eta tea gaa gta gca gta 
Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 

cac aac aag cca age gat tgc tgg att gtt gta aaa aac aag gtg tat 
His Asn Lys Pro Ser Asp Cys Trp He Val Val Lys Asn Lys Val Tyr 
115 120 125 

gat gtt tec aat ttt gcg gac gag cat ccc gga gga tea gtt att agt 
Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 

act tat ttt gga cga gac ggc aca gat gtt ttc tct agt ttt cat gca 
Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 

get tct aca tgg aaa att ctt caa gac ttt tac att ggt gac gtg gag 
Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr lie Gly Asp Val Glu 
165 170 175 

agg gtg gag ccg act cca gag ctg ctg aaa gat ttc cga gaa atg aga 
Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 



get ctt ttc ctg agg gag caa ctt ttc aaa agt teg aaa ttg tac tat 
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Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 205 

gtt atg aag ctg .etc acg aat gtt get att ttt get gcg age att gca 672 
Val Met Lys Leu Leu Thr Asn Val Ala lie Phe Ala Ala Ser He Ala 
210 215 220 

ata ata tgt tgg age aag act att tea gcg gtt ttg get tea get tgt 720 
He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235. L 240 

atg atg get ctg tgt ttc caa cag tgc gga tgg eta tec cat gat ttt 7 68 

Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 

etc cac aat cag gtg ttt gag aca cgc tgg ctt aat gaa gtt gtc ggg 816 
Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 

tat gtg ate ggc aac gee gtt ctg ggg ttt agt aca ggg tgg tgg aag 864 
Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 ; 285 

gag aag cat aac ctt cat cat get get cca aat gaa tgc gat cag act 912 
Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 

tac caa cca att gat gaa gat att gat act etc ccc etc att gee tgg 960 
Tyr Gin Pro lie Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 

age aag gac ata ctg gee aca gtt gag aat aag aca ttc ttg cga ate 1008 
Ser Lys Asp He Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg He 
325 330 335 

etc caa tac cag cat ctg ttc ttc atg ggt ctg tta ttt ttc gee cgt 1056 
Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 

ggt agt tgg etc ttt tgg age tgg aga tat ace tct aca gca gtg etc 1104 
Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 

tea cct gtc gac agg ttg ttg gag aag gga act gtt ctg ttt cac tac 1152 
Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 

ttt tgg ttc gtc ggg aca gcg tgc tat ctt etc cct ggt tgg aag cca 1200 
Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 

tta gta tgg atg gcg gtg act gag etc atg ccc ggc atg ctg ctg ggc 1248 
Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 

ttt gta ttt gta ctt age cac aat ggg atg gag gtt tat aat teg tct 1296 
Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 



i. 



15 



420 425 430 

aaa gaa ttc gtg agt gca cag ate gta tec aca egg gat ate aaa gga 
Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 

aac ata ttc aac.gac tgg ttc act ggt ggc ctt aac agg caa ata gag 
Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
450 455 460 

cat cat ctt ttc cca aca atg ccc agg cat aat tta aac aaa ata gca 
His His Leu Phe Pro. Thr Met Pro Arg His Asn Leu Asn Lys lie Ala 
465 470 475 480 

cct aga gtg gag gtg ttc tgt aag aaa cac ggt ctg gtg tac gaa gac 
Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 

gta tct att get ace ggc act tgc aag gtt ttg aaa gca ttg aag gaa 
Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 



gtc gcg gag get gcg gca gag cag cat get acc acc agt taa 
Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 



<210> 10 

<211> 525 

<212> PRT 

<213> Physcomitrella patens 

<400> 10 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
1 5 10 15 



lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 



Ser. Tyr Val Ser Ser Thr Val Gly 
35 40 



Pro Leu Lys Arg Leu Thr Ser Lys 
50 55 



Val Gin Cys lie Ser Ala Glu Val 

65 .70 



Ser Trp Ser Val His Ser lie Gin 
45 



Lys Arg Val Ser Glu Ser Ala Ala 
60 



Gin Arg Asn Ser Ser Thr Gin Gly 
75 80 



Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 
85 90 95 
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Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 



His Asn Lys Pro Ser Asp Cys Trp lie Val Val Lys Asn Lys Val Tyr 
115 120 125 



Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val lie Ser 
130 135 140 



Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 150 155 160 



Ala Ser Thr Trp Lys lie Leu Gin Asp Phe Tyr lie Gly Asp Val Glu 
165 170 175 



Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 



Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 200 ■ 205 



Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 220 



He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 



Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 



Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
260 265 270 

Tyr Val He Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
275 280 285 

Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 



Tyr Gin Pro He Asp Glu Asp He Asp Thr Leu Pro Leu He Ala Trp 
305 310 315 320 
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Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg lie 
.325 330 335 



Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 



Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
355 360 365 



Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 



Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 



Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 



Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 



Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 



Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
450 455 460 



His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys lie Ala 
465 470 475 480 



Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 



Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 



Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 



<210> 11 

<211> 1192 

<212> DNA 

<213> Physcomitrella patens 
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<220> 

<221> CDS 

<222> (58).. (930) 

<223> Delta-6-elongase 



<400> 11 

ctgcttcgtc tcatcttggg ggtgtgattc gggagtgggt tgagttggtg gagcgca 57 

atg gag gtc gtg gag aga ttc tac ggt gag ttg gat ggg aag gtc teg 105 
Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 

cag ggc gtg aat gca ttg ctg ggt agt ttt ggg gtg gag ttg acg gat 153 
Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 

acg ccc act acc aaa ggc ttg ccc etc gtt gac agt ccc aca ccc ate 201 
Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 

gtc etc ggt gtt tct gta tac ttg act att gtc att gga ggg ctt ttg 249 
Val Leu Gly Val Ser Val Tyr Leu Thr He Val lie Gly Gly Leu Leu 
50 55 60 

tgg ata aag gec agg gat ctg aaa ccg cgc gee teg gag cca ttt ttg 297 
Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala* Ser Glu Pro Phe Leu 
65 70 75 80 

etc caa get ttg gtg ctt gtg cac aac ctg ttc tgt ttt gcg etc agt 345 
Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 
85 90 95 

ctg tat atg tgc gtg ggc ate get tat cag get att acc tgg egg tac 393 
Leu Tyr Met Cys Val Gly He Ala Tyr Gin Ala He Thr Trp Arg Tyr 
100 105 110 

tct etc tgg ggc aat gca tac aat cct aaa cat aaa gag atg gcg att 441 
Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 

ctg gta tac ttg ttc tac atg tct aag tac gtg gaa ttc atg gat acc 489 
Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 

gtt ate atg ata ctg aag cgc age acc agg caa ata age ttc etc cac 537 
Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 150 155 160 

gtt tat cat cat tct tea att tec etc att tgg tgg get att get cat 585 
Val Tyr His His Ser Ser He Ser Leu He Trp Trp Ala He Ala His 
165 170 175 



cac get cct ggc ggt gaa gca tat tgg tct gcg get ctg aac tea gga 
His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 



633 
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gtg cat gtt etc atg tat gcg tat tac ttc ttg get gec tgc ctt cga 681 
Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 

agt age cca aag tta aaa aat aag tac ctt ttt tgg ggc agg tac ttg 729 
Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 . 215 220 

aca caa ttc caa atg ttc cag ttt atg ctg aac tta gtg cag get tac 777 
Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 

tac gac atg aaa acg aat gcg cca tat cca caa tgg ctg ate aag att 825 
Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 250 255 

ttg ttc tac tac atg ate teg ttg ctg ttt ctt ttc ggc aat ttt tac 873 
Leu Phe Tyr Tyr Met lie Ser Leu Leu . Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 

gta caa aaa tac ate aaa ccc tct gac gga aag caa aag gga get aaa 921 
Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 

act gag tga gctgtatcaa gecatagaaa ctctattatg ttagaacctg 970 
Thr Glu 
290 

aagttggtgc tttcttatct ccacttatct tttaagcagc atcagttttg aaatgatgtg 1030 

tgggcgtggt ctgeaagtag tcatcaatat aatcggcctg agcacttcag atggattgtt 1090 

agaacatgag taaaagcggt tattaeggtg tttattttgt accaaatcac cgcacgggtg 1150 

aattgaaata tttcagattt gatcaatttc atctgaaaaa aa 1192 



<210> 12 
<211> 290 
<212> PRT 

<213> Physcomitrella patens 
<400> 12 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
15 10 15 



Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 



Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 .40 45 



Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 



20 



50 



55 



60 



Trp. lie Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80. 



Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 
85 90 95 



Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg Tyr 
100 105 110 



Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala lie 
115 120 125 



Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 



Val lie Met lie Leu Lys Arg Ser Thr Arg Gin lie Ser Phe Leu His 
145 150 155 160 



Val Tyr His His Ser Ser lie Ser Leu lie Trp Trp Ala lie Ala His 
165 170 175 



His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 



Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 



Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 



Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 



Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 250 255 



Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 



Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 
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Thr Glu 
290 



<210> 13 
<211> 1410 



<212> DNA 

<213> Phaeodactylum tricornutum 



<220> 

<221> CDS 

<222> (1) . . (1410) 

<223> Delta-5-desaturase 



<400> 13 

atg get ccg gat gcg gat aag ctt cga caa cgc cag acg act gcg gta 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr- Ala Val 
15 10 15 



gcg aag cac aat get get ace ata teg acg cag gaa cgc ctt tgc agt 
Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 
20 25 30 

ctg tct teg etc aaa ggc gaa gaa gtc tgc ate gac gga ate ate tat 
Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly lie lie Tyr 
35 40 45 

gac etc caa tea ttc gat cat ccc ggg ggt gaa acg ate aaa atg ttt 
Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr lie Lys Met Phe 
50 . 55 60 

ggt ggc aac gat gtc act gta cag tac aag atg att cac ccg tac cat 
Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met lie His Pro Tyr His 
65 70 75 80 

ace gag aag cat ttg gaa aag atg aag cgt gtc ggc aag gtg acg gat 
Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 
85 90 95 

ttc gtc tgc gag tac aag ttc gat ace gaa ttt gaa cgc gaa ate aaa 
Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu lie Lys 
100 105 110 



cga gaa gtc ttc aag att gtg cga cga ggc aag gat ttc ggt act ttg 
Arg Glu Val Phe Lys lie Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 120 125 

gga tgg ttc ttc cgt gcg ttt .tgc tac att gec att ttc ttc tac ctg 
Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 . 140 



cag tac cat tgg gtc ace acg gga ace tct tgg ctg ctg gee gtg gee 
Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
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145 150 155 160 

tac gga ate tec caa gcg atg att ggc atg aat gtc cag cac gat gec 

Tyr.Gly He Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 

165 170 175 



528 



aac cac ggg gec acc tec aag cgt ccc tgg gtc aac gac atg eta ggc 576 
Asn His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 

etc ggt gcg gat ttt att ggt ggt tec aag tgg etc tgg cag gaa caa 624 
Leu Gly Ala Asp Phe He Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 200 205 

cac tgg acc cac cac get tac acc aat cac gee gag atg gat ccc gat 672 
His Trp Thr His His Ala Tyr Thr Asn His Ala Glu Met Asp Pro Asp 
210 215 220 

age ttt ggt gee gaa cca atg etc eta ttc aac gac tat ccc ttg gat ' 72 0 
Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
225 230 235 240 

cat ccc get cgt acc tgg eta cat cgc ttt caa gca ttc ttt tac atg 768 
His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 

ccc gtc ttg get gga tac tgg ttg tec get gtc 'ttc aat cca caa att 816 
Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin He 
260 265 270 

ctt gac etc cag caa cgc ggc gca ctt tec gtc ggt ate cgt etc gac 864 
Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 

aac get ttc att cac teg cga cgc aag tat gcg gtt ttc tgg egg get 912 
Asn Ala Phe He His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
290 295 300 

gtg tac att gcg gtg aac gtg att get ccg ttt tac aca aac tec ggc 960 
Val Tyr He Ala Val Asn Val He Ala Pro Phe Tyr Thr Asn Ser Gly 
305 310 315 320 

etc gaa tgg tec tgg cgt gtc ttt gga aac ate atg etc atg ggt gtg 1008 
Leu Glu Trp Ser Trp Arg Val Phe Gly Asn He Met Leu Met Gly Val 
325 330 335 

gcg gaa teg etc gcg ctg gcg gtc ctg ttt teg ttg teg cac aat ttc 1056 
Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser His Asn Phe 
340 345 350 

gaa tec gcg gat cgc gat ccg acc gee cca ctg aaa aag acg gga gaa 1104 
Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys Lys Thr Gly Glu 
355 360 365 

cca gtc gac tgg ttc aag aca cag gtc gaa act tec tgc act tac ggt 1152 
Pro Val Asp Trp Phe Lys Thr Gin Val Glu Thr Ser Cys Thr Tyr Gly 
370 375 380 
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gga ttc ctt tec ggt tgc ttc acg gga ggt etc aac ttt cag gtt gaa 1200 
Gly Phe Leu Ser Gly Cys Phe Thr Gly Gly Leu Asn Phe Gin Val Glu 
385 390 395 400 

cac cac ttg ttc cca cgc atg age age get tgg tat ccc tac att gec 1248 
His His Leu Phe Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr lie Ala 
405 410 415 

ccc aag gtc cgc gaa att tgc gee aaa cac ggc gtc cac tac gee tac 1296 
Pro Lys Val Arg Glu lie Cys Ala Lys His Gly Val His Tyr Ala Tyr 
420 425 430 

tac ccg tgg ate cac caa aac ttt etc tec ace gtc cgc tac atg cac 1344 
Tyr Pro Trp lie His Gin Asn Phe Leu Ser Thr Val Arg Tyr Met His 
435 440 445 

gcg gee ggg acc ggt gee aac tgg cgc cag atg gee aga gaa aat ccc 1392 
Ala Ala Gly Thr Gly Ala Asn Trp Arg Gin Met Ala Arg Glu Asn Pro 
450 455 460 

ttg acc gga egg gcg taa 1410 

Leu Thr Gly Arg Ala 

465 



<210> 14 
<211> 469 
<212> PRT 

<213> Phaeodactylum tricornutum 
<400> 14 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 
15 10 15 



Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 
20 . 25 30 



Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly He He Tyr 
35 40 45 



Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr He Lys Met Phe 
50 55 60 



Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met He His Pro Tyr His 
65 70 75 80 



Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 
85 90 95 
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Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu lie Lys 
100 105 110 



Arg Glu Val. Phe Lys lie Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 120 125 



Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 140 



Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
145 150 155 160 



Tyr Gly He Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 
165 170 175 



Asn His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 



Leu Gly Ala Asp Phe He Gly Gly Ser Lys Trp Leu Trp Gin Glu Gin 
195 200 205 



His Trp Thr His His Ala Tyr Thr Asn His Ala Glu Met Asp Pro Asp 
210 215 220 



Ser Phe Gly Ala Glu Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp 
225 230 235 240 



His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 



Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin He 
260 265 270 



Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly He Arg Leu Asp 
275 280 285 



Asn Ala Phe He His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
290 295 300 



Val Tyr He Ala Val Asn Val He Ala Pro Phe Tyr Thr Asn Ser Gly 
305 310 315 320 



Leu Glu Trp Ser Trp Arg Val Phe Gly Asn He Met Leu Met Gly Val 
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325 



330 



335 



Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser His Asn Phe 
340 345 350 



Glu Ser Ala Asp Arg Asp Pro Thr 
355 360 



Pro Val Asp Trp Phe Lys Thr Gin 
370 375 



Gly Phe Leu Ser Gly Cys Phe Thr 
385 390 



His His Leu Phe Pro Arg Met Ser 
405 



Ala Pro Leu Lys Lys Thr Gly Glu 
365 

Val Glu Thr Ser Cys Thr Tyr Gly 
380 

Gly Gly Leu Asn Phe Gin Val Glu 
395 400 

Ser Ala Trp Tyr Pro Tyr lie Ala 
410 415 



Pro Lys Val Arg Glu lie Cys Ala Lys His Gly Val His Tyr Ala Tyr 
420 425 430 



Tyr Pro Trp lie His Gin Asn Phe Leu Ser Thr Val Arg Tyr Met His 
435 440 445 



Ala Ala Gly Thr Gly Ala Asn Trp Arg Gin Met Ala Arg Glu Asn Pro 
450 455 460 



Leu Thr Gly Arg Ala 
465 



<210> 15 

<211> 3598 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(3598) 

<223> The sequence is a plant promoter- terminator expression cassette in vector 
PUC19 



<400> 15 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
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ttggcgggtg 


tcggggctgg 


cttaactatg 


cggcatcaga 


gcagattgta 


ctgagagtgc 


180 


accatatgcg 


gtgtgaaata 


ccgcacagat 


gcgtaaggag 


aaaa.taccgc 


atcaggcgcc 


240 


attcgccatt 


caggctgcgc 


aactgttggg 


aagggcgatc 


ggtgcgggcc 


tcttcgctat 


300 


tacgccagct 


ggcgaaaggg 


ggatgtgctg 


caaggcgatt 


aagttgggta 


acgccagggt 


360 


tttcccagtc 


acgacgttgt 


aaaacgacgg 


ccagtgaatt 


cggcgcgccg 


agctcctcga 


420 


gcaaatttac 


acattgccac 


taaacgtcta 


aacccttgta 


atttgttttt 


gttttactat 


480 


gtgtgttatg 


tatttgattt 


gcgataaatt 


tttatatttg 


gtactaaatt 


tataacacct 


540 


tttatgctaa 


cgtttgccaa 


cacttagcaa 


tttgcaagtt 


gattaattga 


ttctaaatta 


600 


tttttgtctt 


ctaaatacat 


atactaatca 


actggaaatg 


taaatatttg 


ctaatatttc 


660 


tactatagga 


gaattaaagt 


gagtgaatat 


ggtaccacaa 


ggtttggaga 


tttaattgtt 


720 


gcaatgctgc 


atggatggca 


tatacaccaa 


acattcaata 


attcttgagg 


ataataatgg 


780 


taccacacaa 


gatttgaggt 


gcatgaacgt 


cacgtggaca 


aaaggtttag 


taatttttca 


840 


agacaacaat 


gttaccacac 


acaagttttg 


aggtgcatgc 


atggatgccc 


tgtggaaagt 


900 


ttaaaaatat 


tttggaaatg 


atttgcatgg 


aagccatgtg* 


taaaaccatg 


acatccactt 


960 


ggaggatgca 


ataatgaaga 


aaactacaaa 


tttacatgca 


actagttatg 


catgtagtct 


1020 


atataatgag 


gattttgcaa 


tactttcatt 


catacacact 


cactaagttt 


tacacgatta 


1080 


taatttcttc 


atagccagcc 


caccgcggtg 


ggcggccgcc 


tgcagtctag 


aaggcctcct 


1140 


gctttaatga 


gatatgcgag 


acgcctatga 


tcgcatgata 


tttgctttca 


attctgttgt 


1200 


gcacgttgta 


aaaaacctga 


gcatgtgtag 


ctcagatcct 


taccgccggt 


ttcggttcat 


1260 


tctaatgaat 


atatcacccg 


ttactatcgt 


atttttatga 


ataatattct 


ccgttcaatt 


1320 


tactgattgt 


ccgtcgacga 


attcgagctc 


ggcgcgccaa 


gcttggcgta 


atcatggtca 


1380 


tagctgtttc 


ctgtgtgaaa 


ttgttatccg 


ctcacaattc 


cacacaacat 


acgagccgga 


1440 


agcataaagt 


gtaaagcctg 


gggtgcctaa 


tgagtgagct 


aactcacatt 


aattgcgttg 


1500 


cgctcactgc 


ccgctttcca 


gtcgggaaac 


ctgtcgtgcc 


agctgcatta 


atgaatcggc 


1560 


caacgcgcgg 


ggagaggcgg 


tttgcgtatt 


gggcgctctt 


ccgcttcctc 


gctcactgac 


1620 


tcgctgcgct 


cggtcgttcg 


gctgcggcga 


gcggtatcag 


ctcactcaaa 


ggcggtaata 


1680 


cggttatcca 


cagaatcagg 


ggataacgca 


ggaaagaaca 


tgtgagcaaa 


aggccagcaa 


1740 


aaggccagga 


accgtaaaaa 


ggccgcgttg 


ctggcgtttt 


tccataggct 


ccgcccccct 


1800 


gacgagcatc 


acaaaaatcg 


acgctcaagt 


cagaggtggc 


gaaacccgac 


aggactataa 


1860 
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agataccagg cgtttcccce tggaagctcc ctcgtgcgct ctcctgttcc gaccctgccg 1920 

cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttc tcatagctca 1980 

cgctgtaggt atctcagttc ggtgtaggtc gttcgctcca agctgggctg tgtgcacgaa 2040 

ccccccgttc agcccgaccg ctgcgcctta tccggtaact atcgtcttga gtccaacccg 2100 

gtaagacacg acttatcgcc actggcagca gccactggta acaggattag cagagcgagg 2160 

tatgtaggcg gtgctacaga gttcttgaag tggtggccta actacggcta cactagaagg 2220 

acagtatttg gtatctgcgc tctgctgaag ccagttacct tcggaaaaag agttggtagc 2280 

tcttgatccg gcaaacaaac caccgctggt agcggtggtt tttttgtttg caagcagcag 2340 

attacgcgca gaaaaaaagg atctcaagaa gatcctttga tcttttctac ggggtctgac 2400 

gctcagtgga acgaaaactc acgttaaggg attttggtca tgagattatc aaaaaggatc 2460 

ttcacctaga tccttttaaa ttaaaaatga agttttaaat caatctaaag tatatatgag 252 0 

taaacttggt ctgacagtta ccaatgctta atcagtgagg cacctatctc agcgatctgt 25.80 

ctatttcgtt catccatagt tgcctgactc cccgtcgtgt agataactac gatacgggag 2 640 

ggcttaccat ctggccccag tgctgcaatg ataccgcgag acccacgctc accggctcca 2700 

gatttatcag caataaacca gccagccgga agggccgagc gcagaagtgg tcctgcaact 2760 

ttatccgcct ccatccagtc tattaattgt tgccgggaag ctagagtaag tagttcgcca 2820 

gttaatagtt tgcgcaacgt tgttgccatt gctacaggca tcgtggtgtc acgctcgtcg 2 880 

tttggtatgg cttcattcag ctccggttcc caacgatcaa ggcgagttac atgatccccc 2940 

atgttgtgca aaaaagcggt tagctccttc ggtcctccga tcgttgtcag aagtaagttg 3000 

gccgcagtgt tatcactcat ggttatggca gcactgcata attctcttac tgtcatgcca 3060 

tccgtaagat gcttttctgt gactggtgag tactcaacca agtcattctg agaatagtgt 3120 

atgcggcgac cgagttgctc ttgcccggcg tcaatacggg ataataccgc gccacatagc 3180 

agaactttaa aagtgctcat cattggaaaa cgttcttcgg ggcgaaaact ctcaaggatc 3240 

ttaccgctgt tgagatccag ttcgatgtaa cccactcgtg cacccaactg atcttcagca 3300 

tcttttactt tcaccagcgt ttctgggtga gcaaaaacag gaaggcaaaa tgccgcaaaa 3360 

aagggaataa gggcgacacg gaaatgttga atactcatac tcttcctttt tcaatattat 3420 

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa 3480 

aataaacaaa taggggttcc gcgcacattt ccccgaaaag tgccacctga cgtctaagaa 3540 
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accattatta tcatgacatt aacctataaa aataggcgta tcacgaggcc ctttcgtc 3 598 

<210> 16 

<211> 3590 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(3590) 

<223> The sequence is a plant promoter- terminator expression cassette in vector 
PUC19 

<400> 16 



tcgcgcgttt 


cggtgatgac 


ggtgaaaacc 


tctgacacat 


gcagctcccg 


gagacggtca 


60 


cagcttgtct 


gtaagcggat 


gccgggagca 


gacaagcccg 


tcagggcgcg 


tcagcgggtg 


120 


ttggcgggtg 


tcggggctgg 


cttaactatg 


cggcatcaga 


gcagattgta 


ctgagagtgc 


180 


accatatgcg 


gtgtgaaata 


ccgcacagat 


gcgtaaggag 


aaaataccgc 


atcaggcgcc 


240 


attcgccatt 


caggctgcgc 


aactgttggg 


aagggcgatc 


ggtgcgggcc 


tcttcgctat 


300 


tacgccagct 


ggcgaaaggg 


ggatgtgctg 


caaggcgatt 


aagttgggta 


acgccagggt 


360 


tttcccagtc 


acgacgttgt 


aaaacgacgg 


ccagtgaatt 


cggcgcgccg 


agctcctcga 


420 


gcaaatttac 


acattgccac 


taaacgtcta 


aacccttgta 


atttgttttt 


gttttactat 


480 


gtgtgttatg 


tatttgattt 


gcgataaatt 


tttatatttg 


gtactaaatt 


tataacacct 


540 


tttatgctaa 


cgtttgccaa 


cacttagcaa 


tttgcaagtt 


gattaattga 


ttctaaatta 


600 


tttttgtctt 


ctaaatacat 


atactaatca 


actggaaatg 


taaatatttg 


ctaatatttc 


660 


tactatagga 


gaattaaagt 


gagtgaatat 


ggtaccacaa 


ggtttggaga 


tttaattgtt 


720 


gcaatgctgc 


atggatggca 


tatacaccaa 


acattcaata 


attcttgagg 


ataataatgg 


780 


taccacacaa 


gatttgaggt 


gcatgaacgt 


cacgtggaca 


aaaggtttag 


taatttttca 


840 


agacaacaat 


gttaccacac 


acaagttttg 


aggtgcatgc 


atggatgccc 


tgtggaaagt 


900 


ttaaaaatat 


tttggaaatg 


atttgcatgg 


aagccatgtg 


taaaaccatg 


acatccactt 


960 


ggaggatgca 


ataatgaaga 


aaactacaaa 


tttacatgca 


actagttatg 


catgtagtct 


1020 


atataatgag 


gattttgcaa 


tactttcatt 


catacacact 


cactaagttt 


tacacgatta 


1080 


taatttcttc 


atagccagcg 


gatccgatat 


cgggcccgct 


agcgttaacc 


ctgctttaat 


1140 


gagatatgcg 


agacgcctat 


gatcgcatga 


tatttgcttt 


caattctgtt 


gtgcacgttg 


1200 
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taaaaaacct 


gagcatgtgt 


agctcagatc 


cccaccgccg 




4— +- +- ^ n |- /^r a 

dLLCLaaLya 


1ZDU 


atatatcacc 


cgt tactatc 


gcatccccac 


gaacaacacc 


ccccgcccaa 


ccuacLgauL. 


i 9 n 


gtccgtcgac 


gaattcgagc 


tcggcgcgcc 


aagc ttggcg 


caaccacggc 


cacagccgcc 


ijOU 


tcctgtgtga 


aattgttatc 


cgctcacaat 


tccacacaac 


acacgagccg 


gaagcacaaa 


1 AAfi 

± ^i^t U 


gtgtaaagcc 


tggggtgcct 


aatgagtgag 


ctaactcaca 


4— 4— T3 4— 4— 4— 

ccaaccgcgc 


cgcgcccacc 


IjUU 


gcccgctttc 


cagtcgggaa 


acctgtcgtg 


ccagctgcat 


caacgaaccg 


gccaacgcgc 


1 r £ n 

1DDU 


ggggagaggc 


ggtttgcgta 


ttgggcgctc 


ttccgcttcc 


ccgcccaccg 


--\ 4— i^r y"* 4~ y— * yr 

ac ccgccgcg 


IDZU 


ctcggtcgtt 


cggctgcggc 


gagcggtatc 


agctcactca 


aaggcggtaa 


tacggt tatc 


i ton 

J. D oU 


cacagaatca 


ggggataacg 


caggaaagaa 


catgtgagca 


aaaggecage 


aaaaggccag 


1 / 4 U 


gaaccgtaaa 


aaggccgcgt 


tgctggcgtt 


tttccatagg 


ctccgccccc 


ctgacgagca 


loUU 


tcacaaaaat 


cgacgctcaa 


gtcagaggtg 


gcgaaacccg 


acaggactat 


aaagatacca 


lobU 


ggcgtttccc 


cctggaagct 


ccctcgtgcg 


ctctcctgtt 


ccgaccctgc 


cgcttaccgg 


iyz u 


atacctgtcc 


gcctttctcc 


cttcgggaag 


cgtggcgctt 


tctcatagct 


caegctgtag 


1 q q n 


gtatctcagt 


tcggtgtagg 


tcgttcgctc 


caagctgggc 


tgtgtgcacg 


aaccccccgt 


OA/in 
z U4U 


tcagcccgac 


cgctgcgcct 


tatccggtaa 


ctaccgtct t 


gagtccaacc 


eggtaagaca 


o 1 no 
ZIUV 


cgacttatcg 


ccactggcag 


cagccactgg 


taacaggat t 


ageagagega 


ggtatgtagg 


9 1 £ n 


cggtgctaca 


gagttcttga 


agtggtggcc 


taactacggc 


tacactagaa 


ggacagtatt 


zzz U 


tggtatctgc 


gctctgctga 


agccagttac 


cttcggaaaa 


agagttggta 


gctcttgatc 


zz oU 


cggcaaacaa 


accaccgctg 


gtagcggtgg 


tttttttgtt 


tgeaagcage 


agattacgeg 


Z^4tU 


cagaaaaaaa 


ggatctcaag 


aagatccttt 


gacccL t tec 


aeggggtctg 


aege tcagtg 


Z4UU 


gaacgaaaac 


tcacgttaag 


ggatt ttggt 


cat gaga t ta 


tcaaaaagga 


cccccaccca 


Z 4t o u 


gatcctttta 


aattaaaaat 


gaagt tt taa 


atcaatctaa 


agtatatatg 


-^h f+x 4*" ^ ^ ^ 4~ 4~ y^r 

agcaaac Leg 


9 R9 n 

Z jZ U 


gtctgacagt 


taccaatgct 


taatcagtga 


ggcacccacc 


ccagcgaccc 


gLCL.acL.ccg 


neon 

Z Doll 


c ccatccaca 


gttgcctgac 


tccccgtcgt 


gtagataact 


aegataeggg 


y-< /*r f*>r y^ 4~ 4" ^ y*l y^i 

agggcccacc 


9 £/i n 

Z D4i U 


atctggcccc 


agtgctgcaa 


tgataccgcg 


agacccacgc 


tcaccggctc 


cagatttatc 


2700 


agcaataaac 


cagccagccg 


gaagggccga 


gcgcagaagt 


ggtcctgcaa 


ctttatccgc 


2760 


ctccatccag 


tctattaatt 


gttgccggga 


agctagagta 


agtagttcgc 


cagttaatag 


2820 


tttgcgcaac 


gttgttgcca 


ttgctacagg 


catcgtggtg 


tcacgctcgt 


cgtttggtat 


2880 
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ggcttcattc agctccggtt cccaacgatc aaggcgagtt acatgatccc ccatgttgtg 
caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc agaagtaagt tggccgcagt 
gttatcactc atggttatgg cagcactgca taattctctt actgtcatgc catccgtaag 
atgcttttct gtgactggtg agtactcaac caagtcattc tgagaatagt gtatgcggcg 
accgagttgc tcttgcccgg cgtcaatacg ggataatacc gcgccacata gcagaacttt 
aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct 
gttgagatcc agttcgatgt aacccactcg tgcacccaac tgatcttcag catcttttac 
tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat 
aagggcgaca cggaaatgtt gaatactcat actcttcctt tttcaatatt attgaagcat 
ttatcagggt tattgtctca tgagcggata catatttgaa tgtatttaga aaaataaaca 
aataggggtt ccgcgcacat ttccccgaaa agtgccacct gacgtctaag aaaccattat 
tatcatgaca ttaacctata aaaataggcg tatcacgagg ccctttcgtc 

<210> 17 

<211> 3584 
<212> . DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(3584) 

<223> The sequence is a plant promoter- terminator expression cassette in vector 
PUC19 

<400> 17 



tcgcgcgttt 


cggtgatgac 


ggtgaaaacc 


tctgacacat 


gcagctcccg 


gagacggtca 


60 


cagcttgtct 


gtaagcggat 


gccgggagca 


gacaagcccg 


tcagggcgcg 


tcagcgggtg 


120 


ttggcgggtg 


tcggggctgg 


cttaactatg 


cggcatcaga 


gcagattgta 


ctgagagtgc 


180 


accatatgcg 


gtgtgaaata 


ccgcacagat 


gcgtaaggag 


aaaataccgc 


atcaggcgcc 


240 


attcgccatt 


caggctgcgc 


aactgttggg 


aagggcgatc 


ggtgcgggcc 


tcttcgctat 


300 


tacgccagct 


ggcgaaaggg 


ggatgtgctg 


caaggcgatt 


aagttgggta 


acgccagggt 


360 


tttcecagtc 


acgacgttgt 


aaaacgacgg 


ccagtgaatt 


cggcgcgccg 


agctcctcga 


420 


gcaaatttac 


acattgccac 


taaacgtcta 


aacccttgta 


atttgttttt 


gttttactat 


480 


gtgtgttatg 


tatttgattt 


gcgataaatt 


tttatatttg 


gtactaaatt 


tataacacct 


540 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3590 
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tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 600 

tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 

tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 

gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat 1140 

gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 

taaaaaacct gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 
atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt . 1320 

gtccgtcgac gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 1500 

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 1560 

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 1620 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 

taaaaaggcc gcgttgctgg cgtttt.tcca taggctccgc ccccctgacg agcatcacaa 1800 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1920 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 

tacagagttc ttgaagtggt ggcctaacta cggctacact agaaggacag tatttggtat 2220 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 22 80 
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acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta . cgcgcagaaa 2340 

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga 2400 

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct 2460 

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga 2520 

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc 2580 

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg 2 640 

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat 2700 

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat 2760 

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg 2820 

caacgttgtt. gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc 2880 

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa 2940 

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc 3000 

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt 3060 

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag 3120 

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt 3180 

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag 3240 

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac 3300 

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc 33 60 

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca 3420 

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg 3480 

ggttccgcgc acatttcccc gaaaagtgcc acctgacgtc taagaaacca ttattatcat 3 540 

gacattaacc tataaaaata ggcgtatcac gaggcccttt cgtc 3584 

<210> 18 
<211> 4507 
<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 
<222> (1)..{4507) 



<223> 
PUC19 



The sequence is a plant promoter- terminator expression cassette in vector 



33 



<400> 18 

tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca 60 
cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg 120 
ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc 180 
accatatgcg gtgtgaaata ccgcacagat gcgtaaggag aaaataccgc atcaggcgcc 240 
attcgccatt caggctgcgc aactgttggg aagggcgatc ggtgcgggcc tcttcgctat 300 
tacgccagct ggcgaaaggg ggatgtg'ctg caaggcgatt aagttgggta acgccagggt 360 
tttcccagtc acgacgttgt aaaacgacgg ccagtgaatt cggcgcgccg agctcctcga 420 
gcaaatttac acattgccac taaacgtcta aacccttgta atttgttttt gttttactat 480 
gtgtgttatg tatttgattt gcgataaatt tttatatttg gtactaaatt tataacacct 540 
tttatgctaa cgtttgccaa cacttagcaa tttgcaagtt gattaattga ttctaaatta 6.00 
tttttgtctt ctaaatacat atactaatca actggaaatg taaatatttg ctaatatttc 660 
tactatagga gaattaaagt gagtgaatat ggtaccacaa ggtttggaga tttaattgtt 720 
gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 780 
taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 
agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 
ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagcc caccgcggtg ggcggccgcc tgcagtctag aaggcctcct 1140 

gctttaatga gatatgcgag acgcctatga tcgcatgata tttgctttca attctgttgt 1200 

gcacgttgta aaaaacctga gcatgtgtag ctcagatcct taccgccggt ttcggttcat 1260 

tctaatgaat atatcacccg ttactatcgt atttttatga ataatattct ccgttcaatt 1320 

tactgattgt ccgtcgagca aatttacaca ttgccactaa acgtctaaac ccttgtaatt 1380' 

tgtttttgtt ttactatgtg tgtbatgtat ttgatttgcg ataaattttt atatttggta 1440 

ctaaatttat aacacctttt atgctaacgt ttgccaacac ttagcaattt gcaagttgat 1500 

taattgattc taaattattt ttgtcttcta aatacatata ctaatcaact ggaaatgtaa 1560 
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atatttgcta atatttctac tataggagaa ttaaagtgag tgaatatggt accacaaggt 1620 

ttggagattt aattgttgca atgctgcatg gatggcatat acac.caaaca ttcaataatt 1680 

cttgaggata ataatggtac cacacaagat ttga'ggtgca tgaacgtcac gtggacaaaa 1740 

ggtttagtaa tttttcaaga caacaatgtt accacacaca agttttgagg tgcatgcatg 1800 

gatgccctgt ggaaagttta aaaatatttt ggaaatgatt tgcatggaag ccatgtgtaa 1860 

aaccatgaca tccacttgga ggatgcaata atgaagaaaa ctacaaattt acatgcaact 192 0 

agttatgcat gtagtctata taatgaggat tttgcaatac tttcattcat acacactcac 1980 

taagttttac acgattataa tttcttcata gccagcggat ccgatatcgg gcccgctagc 2 040 

gttaaccctg ctttaatgag atatgcgaga cgcctatgat cgcatgatat ttgctttcaa 2100 

ttctgttgtg cacgttgtaa aaaacctgag catgtgtagc tcagatcctt accgccggtt 2160 

tcggttcatt ctaatgaata tatcacccgt tactatcgta tttttatgaa taatattctc 2220 

cgttcaattt actgattgtc cgtcgacgaa ttcgagctcg gcgcgccaag cttggcgtaa 22 80 

tcatggtcat agctgtttcc tgtgtgaaat tgttatccgc tcacaattcc acacaacata 2340 

cgagccggaa gcataaagtg taaagcctgg ggtgcctaat* gagtgagcta actcacatta 2400 

attgcgttgc gctcactgcc cgctttccag tcgggaaacc tgtcgtgcca gctgcattaa 2460 

tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg 2520 

ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag 2580 

gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa 2640 

ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc 2700 

cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca 27 60 

ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg 2 820 

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct 2880 

catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt 2940 

gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag 3000 

tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc 3060 

agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtggcctaa ctacggctac 312 0 

actagaagga cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga 3180 

gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc 3240 

aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg 3300 
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gggtctgacg ctcagtggaa cgaaaactca cgttaaggga ttttggtcat gagattatca 3360 

aaaaggatct tcacctagat ccttttaaat taaaaatgaa gttttaaatc aatctaaagt 3420 

atatatgagt aaacttggtc tgacagttac caatgcttaa tcagtgaggc acctatctca 3480 

gcgatctgtc tatttcgttc atccatagtt gcctgactcc ccgtcgtgta gataactacg 3540 

atacgggagg gcttaccatc tggccccagt gctgcaatga taccgcgaga cccacgctca 3600 

ccggctccag atttatcagc aataaaccag ccagccggaa gggccgagcg cagaagtggt 3 660 

cctgcaactt tatccgcctc catccagtct attaattgtt gccgggaagc tagagtaagt 3720 

agttcgccag ttaatagttt gcgcaacgtt gttgccattg ctacaggcat cgtggtgtca 3780 

cgctcgtcgt ttggtatggc ttcattcagc tccggttccc aacgatcaag gcgagttaca 3840 

tgatccccca tgttgtgcaa aaaagcggtt agctccttcg gtcctccgat cgttgtcaga 3900 

agtaagttgg ccgcagtgtt atcactcatg gttatggcag cactgcataa ttctcttact 3960 

gtcatgccat ccgtaagatg cttttctgtg actggtgagt actcaaccaa gtcattctga 4020 

gaatagtgta tgcggcgacc gagttgctct tgcccggcgt caatacggga taataccgcg 4080 

ccacatagca gaactttaaa agtgctcatc attggaaaac gttcttcggg gcgaaaactc 4140 

tcaaggatct taccgctgtt gagatccagt tcgatgtaac ccactcgtgc acccaactga 4200 

tcttcagcat cttttacttt caccagcgtt tctgggtgag caaaaacagg aaggcaaaat 4260 

gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa tactcatact cttccttttt 4320 

caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat atttgaatgt 4380 

atttagaaaa ataaacaaat aggggttccg cgcacatttc cccgaaaagt gccacctgac 4440 

gtctaagaaa ccattattat catgacatta acctataaaa ataggcgtat cacgaggccc 4500 

tttcgtc 4507 

<210> 19 
<211> 17752 
<212> DNA 

<213> Phaeodactylum tricornutum, . Physcomitrella patens 
CDS 

(11543) (12415) 
Delta-6-elongase 



<220> 
<221> 



CDS 
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<222> (13313) . . (14890) 
<223> Delta-6-desaturase 

<220> 

<221> CDS 

<222> (15791) . . (17200) 
<223> Delta-5-desaturase 

<400>. 19 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 42 0 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 12 00 

gctcactgac tcgctgcgct cggccgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 13 80 
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ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt ■ tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 222 0 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 
gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat " 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 
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ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc. agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 342 0 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 354 0 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat. gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 372 0 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 414 0 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 432 0 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 43 80 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtt'tcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4 62 0 

taaatggcta aaatgagaal atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 
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gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 63 00 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 



ggaccgtggc 


aagaaaacgt 


cccgttgcca 


ggtcctgatc 


gacgaggaaa 


ccgtcgugc u 


DDUU 


gtttgctggc 


gaccactaca 


cgaaattcat 


atgggagaag 


taccgcaagc 


cgccgccgac 




ggcccgacgg 


atgttcgact 


atttcagctc 


gcaccgggag 


ccgtacccgc 


tcaagctgga 


b / z u 


aaccttccgc 


ctcatgtgcg 


gatcggattc 


cacccgcgtg 


aagaagtggc 


gcgagcaggt 


cn on 

b / oU 


cggcgaagcc 


tgcgaagagt 


tgcgaggcag 


cggcctggtg 


gaacacgcct 


gggtcaatga 


bo4 U 


tgacctggtg 


cattgcaaac 


gctagggcct 


tgtggggtca 


gttccggctg 


ggggttcagc 


zr n n n 

by uu 


agccagcgct 


ttactggcat 


ttcaggaaca 


agcgggcact 


gctcgacgca 


cttgcttcgc 


by b u 


tcagtatcgc 


tcgggacgca 


cggcgcgctc 


tacgaactgc 


cgataaacag 


aggattaaaa 


n c\ o n 


ttgacaattg 


tgattaaggc 


tcagattcga 


cggcttggag 


cggccgacgt 


gcaggat t tc 


n n q n 
/ UoU 


cgcgagatcc. 


gattgtcggc 


cctgaagaaa 


gctccagaga 


tgttcgggtc 


cgtttacgag 


"71 / A 

/ 14U 


cacgaggaga 


aaaagcccat 


ggaggcgttc 


gctgaacggt 


tgcgagatgc 


cgtggcattc 


7200 


ggcgcctaca 


tcgacggcga 


gatcattggg 


ctgtcggtct 


tcaaacagga 


ggacggcccc 


7260 


aaggacgctc 


acaaggcgca 


tctgtccggc 


gttttcgtgg 


agcccgaaca 


gcgaggccga 


7320 


ggggtcgccg 


gtatgctgct 


gcgggcgttg 


ccggcgggtt 


tattgctcgt 


gatgatcgtc 


n-i on 

7380 


cgacagattc 


caacgggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


T A A n 


tcgctattct 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgccgggcgg 


ggtcgcggcg 


/ bUU 


acggtaggcg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


. ctgccgctct 


gctaggtagc 


7 r /• a 

/boU 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgcggaa 


ctgcgggcgt 


ggcgctgttg 


neon 


gtgttgacac 


caaacgcagc 


gctagatcct 


gtcggcgtcg 


cagcgggcct 


ggcgggggcg 


n a o n 
/ boU 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


/ /4U 


acctttaccg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


/ oUU 


tttgatccgc 


caatcccgat 


gcctacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


n q c n 
/ obU 


ctgatcggag 


cgggtttaac 


ctacttcctt 


tggttccggg 


ggatctcgcg 


actcgaacct 


Toon 

/ y z u 


acagttgttt 


ccttactggg 


ctttctcagc 


cccagatctg 


gggtcgatca 


gccggggatg 


/ y ou 


catcaggccg 


acagtcggaa 


cttcgggtcc 


ccgacctgta 


ccattcggtg 


agcaatggat 


8040 


aggggagttg 


atatcgtcaa 


cgttcacttc 


taaagaaata 


gcgccactca 


gcttcctcag 


8100 


cggctttatc 


cagcgatttc 


ctattatgtc 


ggcatagttc 


tcaagatcga 


cagcctgtca 


8160 


cggttaagcg 


agaaatgaat 


aagaaggctg 


ataattcgga 


tctctgcgag 


ggagatgata 


8220 



41 



tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 8280 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 87 00 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 93 00 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 

tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 9720 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 9780 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 
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cccc tgcgc t 


gacagccgga 


acacggcggc 


a Lcagaycay 




ccgattgtct 


gttgtgccca 


gtcatagccg 


aatagcctct 


ccacccaagc 


ggccggagaa 


± U ± r± u 


cctgcgtgca 


atccatcttg 


ttcaatccaa 


gctcccatgg 


gccctccjact 


agagtcgaga 


1 09 nn 


tctggattga 


gagtgaatat 


gagactctaa 


t tggataccg 


aggggaat tt 


atggaacgt c 


109^0 
1UZOU 


agtggagcat 


ttttgacaag 


aaatatttgc 


tagctgatag 


tgacct tagg 


cgacc l c uga 


10^90 


acgcgcaata 


atggtt tctg 


acgtatgtgc 


t t age teat t 


aaac tccaga 


aacccgcggc 


1 0^P0 


tgagtggctc 


cttcaacgtt 


gcggttctgt 


cagttccaaa 


egtaaaaegg 


c t tgtcccgc 


i n/i/in 

1 U4t4i U 


gtcatcggcg 


ggggtcataa 


cgtgactccc 


ttaattctcc 


gctcatgatc 


t tgatcccct 


J.UDUU 


gcgccatcag 


atccttggcg 


gcaagaaagc 


catccagttt 


actttgeagg 


gcttcccaac 


1 A (T /T n 


cttaccagag 


ggcgccccag 


ctggcaattc 


eggttcgett 


gctgtccata 


aaaccgccca 


iUbzU 


gtctagctat 


cgccatgtaa 


gcccactgca 


agctacctgc 


tttctctttg 


cgcttgcgtt 


1 A r Q f\ 

lUooU 


ttcccttgtc 


cagatagccc 


agtagctgac 


attcatcegg 


ggtcagcacc 


gtttctgegg 


1 AT yl A 


actggctttc 


tacgtgttcc 


gcttccttta 


gcagcccttg 


cgccctgagt 


gettgeggea 


1 A O A A 


gcgtgaagct 


tgcatgcctg 


caggtcgacg 


gcgcgccgag 


ctcctcgagc 


aaatttacac 


1 A Q C (\ 


attgccacta 


aacgtctaaa 


cccttgtaat 


ttgtttttgt 


tttactatgt 


gtgttatgta 


1 A Q O A 


tttgatttgc 


gataaatttt 


i_ _ 4_ .1—1—4- i_ 

tatatttggt 


actaaattta 


taacaccttt 


tatgetaacg 


T A Q Q A 


tttgccaaca 


cttagcaatt 


tgcaagttga 


ttaattgatt 


ctaaattatt 


tt tgccttcc 


1 1 a yi a 


aaatacatat 


actaatcaac 


tggaaatgta 


aatatt tget 


aatatttcta 


ctataggaga 


111UU 


attaaagtga 


gtgaatatgg 


taccacaagg 


tt tggagatt 


taattgttgc 


aatgc tgcat 


i i i en 


ggatggcata 


tacaccaaac 


~ +_ +_ ~ _ _ A- — .—.4— 

attcaataat 


tc ttgaggat 


aataatggta 


ccacacaaga 


1 1 9 9 n 
11ZZU 


tttgaggtgc 


atgaacgtca 


cgtggacaaa 


aggtt tagta 


att t t tcaag 


acaacaatgt 


i 1 9 q n 


taccacacac 


aagttttgag 


gtgcatgcat 


ggatgccctg 


tggaaagtt t 


aaaaatattt 


i i o / n 


tggaaatgat 


ttgcatggaa 


gccatgtgta 


aaaccatgac 


atccacttgg 


aggatgeaat 


11400 


aatgaagaaa 


actacaaatt 


tacatgcaac 


tagttatgea 


tgtagtctat 


ataatgagga 


11460 


ttttgcaata 


ctttcattca 


tacacactca 


ctaagtttta 


cacgattata 


atttcttcat 


11520 


agccagccca 


ccgcggtgga 


aa atg gag 


gtc gtg gag aga ttc tac ggt gag 


11572 



Met Glu Val Val Glu Arg Phe Tyr Gly Glu 
15 10 
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ttg gat ggg aag gtc teg cag ggc gtg aat gca ttg ctg ggt agt ttt 11620 
Leu Asp Gly Lys .Val Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe 
15 20 25 

ggg gtg gag ttg acg gat acg ccc act acc aaa ggc ttg ccc etc gtt 11668 
Gly Val Glu Leu Thr Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val 
30 35 40 

gac agt ccc aca ccc ate gtc etc ggt gtt tct gta tac ttg act att 11716 
Asp Ser Pro Thr Pro lie Val Leu Gly Val Ser Val Tyr Leu Thr lie 
45 50 55 

gtc att gga ggg ctt ttg tgg ata aag gee agg gat ctg aaa ccg cgc 11764 
Val lie Gly Gly Leu Leu Trp lie Lys Ala Arg Asp Leu Lys Pro Arg 
60 65 70 

gec teg gag cca ttt ttg etc caa get ttg gtg ctt gtg cac aac ctg 11812 
Ala Ser Glu Pro Phe Leu Leu Gin Ala. Leu Val Leu Val His Asn Leu 
75 80 85 90 

ttc tgt ttt gcg etc agt ctg tat atg tgc gtg ggc ate get tat cag 11860 
Phe Cys Phe Ala Leu Ser Leu Tyr Met Cys Val Gly lie Ala Tyr Gin 
95 100 105 

get att acc tgg egg tac tct etc tgg ggc aat gca tac aat cct aaa . 11908 
Ala lie Thr Trp Arg Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys 
110 115 120 

cat aaa gag atg gcg att ctg gta tac ttg ttc tac atg tct aag tac 11956 
His Lys Glu Met Ala lie Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr 
125 130 135 

gtg gaa ttc atg gat acc gtt ate atg ata ctg aag cgc age acc agg 12004 
Val Glu Phe Met Asp Thr Val lie Met lie Leu Lys Arg Ser Thr Arg 
140 145 150 

caa ata age ttc etc cac gtt tat cat cat tct tea att tec etc att 12052 
Gin He Ser Phe Leu His Val Tyr His His Ser Ser He Ser Leu He 
155 160 165 170 

tgg tgg get att get cat cac get cct ggc ggt gaa gca tat tgg tct 12100 
Trp Trp Ala He Ala His His Ala Pro Gly Gly Glu Ala Tyr Trp Ser 
175 180 185 

gcg get ctg aac tea gga gtg cat gtt etc atg tat gcg tat tac ttc 12148 
Ala Ala Leu Asn Ser Gly Val His Val Leu Met Tyr Ala Tyr Tyr Phe 
190 195 200 

ttg get gee tgc ctt cga agt age cca aag tta aaa aat aag tac ctt 12196 
Leu Ala Ala Cys Leu Arg Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu 
205 210 215 

ttt tgg ggc agg tac ttg aca caa ttc caa atg ttc cag ttt atg ctg 12244 
Phe Trp Gly Arg Tyr Leu Thr Gin Phe Gin Met Phe Gin Phe Met Leu 
220 225 230 

aac tta gtg cag get tac tac gac atg aaa acg aat gcg cca tat cca 12292 
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Asn Leu Val Gin Ala Tyr Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro 
235 240 245 250 

caa. tgg ctg ate aag att ttg ttc tac tac atg ate teg ttg ctg ttt 12340 
Gin Trp Leu. lie Lys lie Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe 
255 260 265 

ctt ttc ggc aat ttt tac gta caa aaa tac ate aaa ccc tct gac gga 12388 
Leu Phe Gly Asn Phe Tyr Val Gin Lys Tyr lie Lys Pro Ser Asp Gly 
270 275 280 

aag caa aag gga get aaa act gag tga tctagaaggc ctcctgcttt 12435 
Lys Gin Lys Gly Ala Lys Thr Glu 
285 290 



aatgagatat 


gcgagacgcc 


tatgatcgea 


tgatatttgc 


tttcaattct 


gttgtgcacg 


12495 


ttgtaaaaaa 


cctgagcatg 


tgtagctcag 


atccttaccg 


ccggtttcgg 


ttcattctaa 


12555 


tgaatatatc 


acccgttact 


ategtatttt 


tatgaataat 


attctcegtt 


caatttactg 


12615 


attgtccgtc 


gagcaaattt 


acacattgcc 


actaaaegtc taaacccttg 


taatttgttt 


12675 


ttgttttact 


atgtgtgtta 


tgtatttgat 


ttgcgataaa 


tttttatatt 


tggtactaaa 


12735 


tttataacac 


ettttatget 


aacgtttgcc 


aacacttagc 


aatttgcaag 


ttgattaatt 


12795 


gattctaaat 


tatttttgtc 


ttctaaatac 


atatactaat 


caactggaaa 


tgtaaatatt 


12855 


tgctaatatt 


tctactatag 


gagaattaaa 


gtgagtgaat 


atggtaccac 


aaggtttgga 


12915 


■gatttaattg 


ttgeaatget 


gcatggatgg 


catatacacc 


aaacattcaa 


taattcttga 


12975 


ggataataat 


ggtaccacac 


aagatttgag 


gtgeatgaac 


gtcacgtgga 


caaaaggttt 


13035 


agtaattttt 


caagacaaca 


atgttaccac 


acacaagttt 


tgaggtgcat 


gcatggatgc 


13095 


cctgtggaaa 


gtttaaaaat 


attttggaaa 


tgatttgeat 


ggaagccatg 


tgtaaaacca 


13155 


tgacatccac 


ttggaggatg 


caataatgaa 


gaaaactaca 


aatttacatg 


caactagtta 


13215 


tgcatgtagt 


ctatataatg 


aggattttgc 


aatactttca 


ttcatacaca 


ctcactaagt 


13275 


tttacacgat 


tataatttct 


teatagecag 


eggatec atg gta ttc gcg ggc ggt 


13330 



Met Val Phe Ala Gly Gly 
295 

gga ctt cag cag ggc tct etc gaa gaa aac ate gac gtc gag cac att 1337 8 
Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn lie Asp Val Glu His lie 
300 305 310 

gec agt atg tct etc ttc age gac ttc ttc agt tat gtg tct tea' act 13426 
Ala Ser Met Ser Leu Phe Ser Asp Phe Phe Ser Tyr Val Ser Ser Thr 
315 320 325 

gtt ggt teg tgg age gta cac agt ata caa cct ttg aag cgc ctg acg 13474 
Val Gly Ser Trp Ser Val His Ser lie Gin Pro Leu Lys Arg Leu Thr 
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330 335 340 

agt aag aag cgt gtt teg gaa age get gee gtg caa tgt ata tea get 13 522 
Ser Lys Lys Arg Val Ser Glu Ser Ala Ala Val Gin Cys lie Ser Ala 
345 350 355 360 

gaa gtt cag aga aat teg agt acc cag gga act gcg gag gee etc gca 13570 
Glu Val Gin Arg Asn Ser Ser Thr Gin Gly Thr Ala Glu Ala Leu Ala 
365 370 375 

gaa tea gtc gtg aag ccc acg aga cga agg tea tct cag tgg aag aag 13618 
Glu Ser Val Val Lys Pro Thr Arg Arg Arg Ser Ser Gin Trp Lys Lys 
380 385 390 

teg aca cac ccc eta tea gaa gta gca gta cac aac aag cca age gat 13666 
Ser Thr His Pro Leu Ser Glu Val Ala Val His Asn Lys Pro Ser Asp 
395 400 405 

tgctgg att gtt gta aaa aac aag gtg tat gat gtt tec aat ttt gcg 13714 
Cys Trp lie Val Val Lys Asn Lys Val Tyr Asp Val Ser Asn Phe Ala 
410 415 420 

gac gag cat ccc gga gga tea gtt att agt act tat ttt gga cga gac 13762 
Asp Glu His Pro Gly Gly Ser Val lie Ser Thr Tyr Phe Gly Arg Asp 
425 430 435 440 

ggc aca gat gtt ttc tct agt ttt cat gca get tct aca tgg aaa att 13810 
Gly Thr Asp Val Phe Ser Ser Phe His Ala Ala Ser Thr Trp Lys lie 
445 450 455 

ctt caa gac ttt tac att ggt gac gtg gag agg gtg gag ccg act cca 13858 
Leu Gin Asp Phe Tyr lie Gly Asp Val Glu Arg Val Glu Pro Thr Pro 
460 465 470 

gag ctg ctg aaa gat ttc cga gaa atg aga get ctt ttc ctg agg gag 13906 
Glu Leu Leu Lys Asp Phe Arg Glu Met Arg Ala Leu Phe Leu Arg Glu 
475 480 485 

caa ctt ttc aaa agt teg aaa ttg tac tat gtt atg aag ctg etc acg 13954 
Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr Val Met Lys Leu Leu Thr 
490 495 . 500 

aat gtt get att ttt get gcg age att gca ata ata tgt tgg age aag 14002 
Asn Val Ala lie Phe Ala Ala Ser lie Ala lie lie Cys Trp Ser Lys 
505 510 515 520 

act att tea gcg gtt ttg get tea get tgt atg atg get ctg tgt ttc 14050 
Thr lie Ser Ala Val Leu Ala Ser Ala Cys Met Met Ala Leu Cys Phe 
525 530 535 

caa cag tgc gga tgg eta tec cat gat ttt etc cac aat cag gtg ttt 14098 
Gin Gin Cys Gly Trp Leu Ser His Asp Phe Leu His Asn Gin Val Phe 
540 . 545 550 

gag aca cgc tgg ctt aat gaa gtt gtc ggg tat gtg ate ggc aac gee 14146 
Glu Thr Arg Trp Leu Asn Glu Val Val Gly Tyr Val lie Gly Asn Ala 
555 560 565 
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gtt ctg ggg ttt agt aca ggg tgg tgg aag gag aag cat aac ctt cat 14194 
Val Leu Gly Phe Ser Thr Gly Trp Trp Lys Glu Lys .His Asn Leu His 
570 575 580 

cat get get cca aat gaa tgc gat cag act tac caa cca att gat gaa 14242 
His Ala Ala Pro Asn Glu Cys Asp Gin Thr Tyr Gin Pro lie Asp Glu 
585 590 595 600 

gat att gat act etc ccc etc att gec tgg age aag gac ata ctg gee 14290 
Asp lie Asp Thr Leu Pro Leu lie Ala Trp Ser Lys Asp lie Leu Ala 
605 610 615 

aca gtt gag aat aag aca ttc ttg cga ate etc caa tac cag cat ctg 1433 8 
Thr Val Glu Asn Lys Thr Phe Leu Arg lie Leu Gin Tyr Gin His Leu 
620 625 630 

ttc ttc atg ggt ctg tta ttt ttc gee cgt ggt agt tgg etc ttt tgg 14386 
Phe Phe Met Gly Leu Leu Phe Phe Ala Arg Gly Ser Trp Leu Phe Trp 
635 640 645 

age tgg aga tat ace tct aca gca gtg etc tea cct gtc gac agg ttg 14434 
Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu Ser Pro Val Asp Arg Leu 
650 655 660 

ttg gag aag gga act gtt ctg ttt cac tac ttt tgg ttc gtc ggg aca 14482 
Leu Glu Lys Gly Thr Val Leu Phe His Tyr Phe' Trp Phe Val Gly Thr 
665 670 675 680 

gcg tgc tat ctt etc cct ggt tgg aag cca tta gta tgg atg gcg gtg 14530 
Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro Leu Val Trp Met Ala Val 
685 690 695 

act gag etc atg tec ggc atg ctg ctg ggc ttt gta ttt gta ctt age 14578 
Thr Glu Leu Met Ser Gly Met Leu Leu Gly Phe Val Phe Val Leu Ser 
700 705 710 

cac aat ggg atg gag gtt tat aat teg tct aaa gaa ttc gtg agt gca 1462 6 
His Asn Gly Met Glu Val Tyr Asn Ser Ser Lys Glu Phe Val Ser Ala 
715 720 725 

cag ate gta tec aca egg gat' ate aaa gga aac ata ttc aac gac tgg 14674 
Gin lie Val Ser Thr Arg Asp lie Lys Gly Asn lie Phe Asn Asp Trp 
730 735 740 

ttc act ggt ggc ctt aac agg caa ata gag cat cat ctt ttc cca aca 14722 
Phe Thr Gly Gly Leu Asn Arg Gin lie Glu His His Leu Phe Pro Thr 
745 750 755 760 

atg ccc agg cat aat tta aac aaa ata gca cct aga gtg gag gtg ttc 1477 0 
Met Pro Arg His Asn Leu Asn Lys lie Ala Pro Arg Val Glu Val Phe 

765 770 775 

tgt aag aaa cac ggt ctg gtg tac gaa gac gta tct att get acc ggc 14818 
Cys Lys Lys His Gly Leu Val Tyr Glu Asp Val Ser lie Ala Thr Gly 
780 785 790 
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act tgc aag gtt ttg aaa gca ttg aag gaa gtc gcg gag get gcg gca 14866 
Thr Cys Lys Val Leu Lys Ala Leu Lys Glu Val Ala Glu Ala Ala Ala 
795 800 805 

gag cag cat get ace ace agt taa getagegtta accctgcttt aatgagatat 14920 
Glu Gin His Ala Thr Thr Ser 
810 . 815 

gcgagacgcc tatgatcgea tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa 14980 

cctgagcatg tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaatatatc 15040 

acccgttact ategtatttt tatgaataat attctcegtt caatttactg attgtccgtc 15100 

gagcaaattt acacattgcc actaaaegtc taaacccttg taatttgttt ttgttttact 15160 

atgtgtgtta tgtatttgat ttgcgataaa tttttatatt tggtactaaa tttataacac 15220 

ettttatget aacgtttgcc aacacttagc aatttgcaag ttgattaatt gattctaaat 15280 

tatttttgtc ttctaaatac atatactaat caactggaaa tgtaaatatt tgctaatatt 15340 

tctactatag gagaattaaa gtgagtgaat atggtaccac aaggtttgga gatttaattg 15400 

ttgeaatget gcatggatgg catatacacc aaacattcaa taattcttga ggataataat . 15460 

ggtaccacac aagatttgag gtgeatgaac gtcacgtgga caaaaggttt agtaattttt 15520 

caagacaaca atgttaccac acacaagttt tgaggtgcat gcatggatgc cctgtggaaa 15580 

gtttaaaaat attttggaaa tgatttgeat ggaagccatg tgtaaaacca tgacatccac 15640 

ttggaggatg caataatgaa gaaaactaca aatttacatg caactagtta tgcatgtagt 15700 

ctatataatg aggattttgc aatactttca ttcatacaca ctcactaagt tttacacgat 15760 

tataatttct teatagecag cagatctaaa atg get ccg gat gcg gat aag ctt 15814 

Met Ala Pro Asp Ala Asp Lys Leu 
820 

cga caa cgc cag acg act gcg gta gcg aag cac aat get get acc ata 15,862 
Arg Gin Arg Gin Thr Thr Ala Val Ala Lys His Asn Ala Ala Thr lie 
825 830 835 

teg acg cag gaa cgc ctt tgc agt ctg tct teg etc aaa gge gaa gaa 15910 
Ser Thr Gin Glu Arg Leu Cys Ser Leu Ser Ser Leu Lys Gly Glu Glu 
840 845 850 855 

gtc tgc ate gac gga ate ate tat gac etc caa tea ttc gat cat ccc 15958 
Val Cys lie Asp Gly lie lie Tyr Asp Leu Gin Ser Phe Asp His Pro 
860 865 870 

999 ggt gaa acg ate aaa atg ttt ggt gge aac gat gtc act gta cag 16006 
Gly Gly Glu Thr He Lys Met Phe Gly Gly Asn Asp Val Thr Val Gin 
875 880 885 



tac aag atg att cac ccg tac cat acc gag aag cat ttg gaa aag atg 



16054 
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Tyr Lys Met lie His Pro Tyr His Thr Glu Lys His Leu Glu Lys Met 
890 895 900 



aag. cgt gtc ggc .aag gtg acg gat ttc gtc tgc gag tac aag ttc gat 
Lys Arg Val. Gly Lys Val Thr Asp Phe Val Cys Glu Tyr Lys Phe Asp 
905 910 915 



16102 



acc gaa ttt gaa cgc gaa ate aaa cga gaa gtc ttc aag att gtg cga 
Thr Glu Phe Glu Arg Glu lie Lys Arg Glu Val Phe Lys lie Val Arg 
920 925 930 935 



16150 



cga ggc aag gat ttc ggt act ttg gga tgg ttc ttc cgt gcg ttt tgc 
Arg Gly Lys Asp Phe Gly Thr Leu Gly Trp Phe Phe Arg Ala Phe Cys 
940 945 950 



16198 



tac att gec att ttc ttc tac ctg cag tac cat tgg gtc acc acg gga 
Tyr lie Ala lie Phe Phe Tyr Leu Gin Tyr His Trp Val Thr Thr Gly 
955 960 965 



16246 



acc tct tgg ctg ctg gec gtg gec tac gga ate tec caa gcg atg att 
Thr Ser Trp Leu Leu Ala Val Ala Tyr Gly lie Ser Gin Ala Met lie 
970 975 980 



16294 



ggc atg aat gtc cag cac gat gec aac cac ggg gee acc tec aag cgt 
Gly Met Asn Val Gin His Asp Ala Asn His Gly Ala Thr Ser Lys Arg 
985 990 995 



16342 



ccc tgg gtc aac gac atg eta ggc etc ggt gcg gat ttt att ggt 
Pro Trp Val Asn Asp Met Leu Gly Leu Gly- Ala Asp Phe lie Gly 
1000 1005 1010 



16387 



ggt tec aag tgg etc tgg cag gaa caa cac tgg acc cac cac get 
Gly Ser Lys Trp Leu Trp Gin Glu Gin His Trp Thr His His Ala 
1015 1020 1025 



16432 



tac acc aat cac gee gag atg gat ccc gat age ttt ggt gec gaa 
Tyr . Thr Asn His Ala Glu Met Asp Pro Asp Ser Phe Gly Ala Glu 
1030 1035 1040 



16477 



cca atg etc eta ttc aac gac tat ccc ttg gat cat ccc get cgt 
Pro Met Leu Leu Phe Asn Asp Tyr Pro Leu Asp His Pro Ala Arg 
1045 1050 1055 



16522 



acc tgg eta cat cgc ttt caa gca ttc ttt tac atg ccc gtc ttg 
Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met Pro Val Leu 
1060 1065 1070 



16567 



get gga tac tgg ttg tec get gtc ttc aat cca caa att ctt gac 
Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin lie Leu Asp 
1075 1080 1085 



16612 



etc cag caa cgc ggc gca 
Leu Gin Gin Arg Gly Ala 
1090 1095 



ctt tec gtc ggt ate cgt etc gac aac 
Leu Ser Val Gly lie Arg Leu Asp Asn 
1100 



16657 



get ttc att cac teg cga cgc aag tat gcg gtt ttc tgg egg get 
Ala Phe lie His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 



16702 
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1105 1110 1115 

gtg tac att gcg gtg aac gtg att get ccg ttt tac aca aac tec 
Val Tyr lie Ala Val Asn Val lie Ala Pro Phe Tyr Thr Asn Ser 
1120 1125 1130 

ggc etc gaa tgg tec tgg cgt gtc ttt gga aac ate atg etc atg 
Gly Leu Glu Trp Ser Trp Arg Val Phe Gly Asn lie Met Leu Met 
1135 1140 1145 

ggt gtg gcg gaa teg etc gcg ctg gcg gtc ctg ttt teg ttg teg 
Gly Val Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser 
1150 1155 1160 



cac aat ttc gaa tec gcg gat cgc gat ccg acc gee cca ctg aaa 
His Asn Phe Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys 
1165 1170 1175 

aag acg gga gaa cca gtc gac tgg ttc aag aca cag gtc gaa act 
Lys Thr Gly Glu Pro Val Asp Trp Phe Lys Thr Gin Val Glu Thr 
1180 1185 1190 



tec tgc act tac ggt 
Ser Cys Thr Tyr Gly 
1195 

etc aac ttt cag gtt 
Leu Asn Phe Gin Val 
1210 

get tgg tat ccc tac 
Ala Trp Tyr Pro Tyr 
1225 

aaa cac ggc gtc cac 
Lys His Gly Val His 
1240 



gga ttc ctt tec ggt 
Gly Phe Leu Ser Gly 
1200 

gaa cac cac ttg ttc 
Glu His His Leu Phe 
1215 

att gee ccc aag gtc 
lie Ala Pro Lys Val 
1230 

tac gee tac tac ccg 
Tyr Ala Tyr Tyr .Pro 
1245 ' 



tgc ttc acg gga ggt 
Cys Phe Thr Gly Gly 
1205 

cca cgc atg age age 
Pro Arg Met Ser Ser 
1220 

cgc gaa att tgc gee 
Arg Glu lie Cys Ala 
1235 

tgg ate cac caa aac 
Trp lie His Gin Asn 
1250 



ttt etc tec acc gtc cgc tac atg cac gcg gee ggg acc ggt gee 
Phe Leu Ser Thr Val Arg Tyr Met His Ala Ala Gly Thr Gly Ala 
1255 1260 1265 

aac tgg cgc cag atg gee aga gaa aat ccc ttg acc gga egg gcg 
Asn Trp Arg Gin Met Ala Arg Glu Asn Pro Leu Thr Gly Arg Ala 
1270 1275 1280 

taa agatctgecg gcatcgatcc egggecatgg ectgetttaa tgagatatgc 

gagacgecta tgatcgcatg atatttgett tcaattctgt tgtgcacgtt gtaaaaaacc 

tgagcatgtg tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac 



16747 

16792 

16837 

16882 

16927 

16972 

17017 

17062 

17107 

17152 

17197 

17250 
17310 
17370 



ccgttactat cgtattttta tgaataatat 
cgagctcggc gcgcctctag aggatcgatg 
gttgcggttc tgtcagttcc aaacgtaaaa 



tctccgttca 
aattcagatc 
cggcttgtcc 



atttactgat tgtcegtega 
ggctgagtgg ctccttcaac 
cgcgtcatcg gegggggtea 



17430 
17490 
17550 
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taacgtgact cccttaattc tccgctcatg atcagattgt cgtttcccgc . cttcagttta 17610 

aactatcagt gtttgacagg atatattggc gggtaaacct aagagaaaag agcgtttatt 17 670 

agaataatcg gatatttaaa agggcgtgaa aaggtttatc cttcgtccat ttgtatgtgc 17730 

atgccaacca cagggttccc ca 17 7 52 



<210>. 20 

<211> 290 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens 

<400> 20 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 



Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 .30 



Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 • 45 



Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu Leu 
50 .55 60 



Trp He Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 



Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 
85 90 95 



Leu Tyr Met Cys Val Gly He Ala Tyr Gin Ala He Thr Trp Arg Tyr 
100 105 110 



Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala He 
115 120 125 



Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 135 140 



Val He Met He Leu Lys Arg Ser Thr Arg Gin He Ser Phe Leu His 
145 150 155 160 
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Val Tyr His His Ser Ser lie Ser Leu lie Trp Trp Ala lie Ala His 
165 170 175 



His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 185 190 



Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 



Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 



Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 230 235 240 



Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 250 255 



Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 



Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 



Thr Glu 
290 



<210> 21 

<211> 525 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens 

<400> 21 

Met Val Phe Ala Gly Gly Gly Leu Gin Gin Gly Ser Leu Glu Glu Asn 
1.5 10 15 



lie Asp Val Glu His lie Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
20 25 30 



Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser He Gin 
35 40 45 



Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
50 55 60 
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Val Gin Cys lie Ser Ala Glu Val Gin Arg Asn ,Ser .Ser Thr Gin Gly 
65 70 75 80 



Thr .Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 
85 90 95 



Ser Ser Gin Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
100 105 110 



His Asn Lys Pro Ser Asp Cys Trp lie Val Val Lys Asn Lys Val Tyr 
115 120 125 



Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val He Ser 
130 135 140 



Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145 ■ 150 155 160 



Ala Ser Thr Trp Lys He Leu Gin Asp Phe Tyr- He Gly Asp Val Glu 
165 170 175 



Arg Val Glu Pro Thr Pro. Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
180 185 190 



Ala Leu Phe Leu Arg Glu Gin Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
195 ' 200 205 



Val Met Lys Leu Leu Thr Asn Val Ala He Phe Ala Ala Ser He Ala 
210 215 . 220 



He He Cys Trp Ser Lys Thr He Ser Ala Val Leu Ala Ser Ala Cys 
225 230 235 240 



Met Met Ala Leu Cys Phe Gin Gin Cys Gly Trp Leu Ser His Asp Phe 
245 250 255 



Leu His Asn Gin Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 

260 • 265 270 



Tyr Val He Gly Asn Ala 
275 



Val Leu Gly Phe 
280 



Ser Thr Gly Trp 
285 



Trp Lys 
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Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gin Thr 
290 295 300 



Tyr Gin Pro lie Asp Glu Asp lie Asp Thr Leu Pro Leu lie Ala Trp 
305 310 315 320 



Ser Lys Asp lie Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg lie 
325 330 335 



Leu Gin Tyr Gin His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
340 345 350 



Gly Ser Trp Leu Phe Trp Ser Trp Arg. Tyr Thr Ser Thr Ala Val Leu 
355 360 365 



Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
370 375 380 



Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385 390 395 400 



Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
405 410 415 



Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
420 425 430 



Lys Glu Phe Val Ser Ala Gin lie Val Ser Thr Arg Asp lie Lys Gly 
435 440 445 



Asn lie Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gin lie Glu 
450 455 460 



His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys lie Ala 
465 470 475 480 



Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
485 490 495 



Val Ser lie Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
500 505 510 
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Val Ala Glu Ala Ala Ala Glu Gin His Ala Thr Thr Ser 
515 520 525 



<210> 22 
<211> 469 
<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens 
<400> 22 

Met Ala Pro Asp Ala Asp Lys Leu Arg Gin Arg Gin Thr Thr Ala Val 
15 10 15 



Ala Lys His Asn Ala Ala Thr lie Ser Thr Gin Glu Arg Leu Cys Ser 
20 25 30 



Leu Ser Ser Leu Lys Gly Glu Glu Val Cys lie Asp Gly lie lie Tyr 
35 40 45 



Asp Leu Gin Ser Phe Asp His Pro Gly Gly Glu Thr lie Lys Met Phe 
50 55 60 



Gly Gly Asn Asp Val Thr Val Gin Tyr Lys Met lie His Pro Tyr His 
65 70 75 80 



Thr Glu Lys His Leu Glu Lys Met Lys Arg Val Gly Lys Val Thr Asp 
85 90 95 



Phe Val Cys Glu Tyr Lys Phe Asp Thr Glu Phe Glu Arg Glu lie Lys 
100 105 110 



Arg Glu Val Phe Lys lie Val Arg Arg Gly Lys Asp Phe Gly Thr Leu 
115 .120 125 



Gly Trp Phe Phe Arg Ala Phe Cys Tyr lie Ala lie Phe Phe Tyr Leu 
130 135 140 



Gin Tyr His Trp Val Thr Thr Gly Thr Ser Trp Leu Leu Ala Val Ala 
145 150 155 160 



Tyr Gly lie Ser Gin Ala Met He Gly Met Asn Val Gin His Asp Ala 
165 170 175 



Asn His Gly Ala Thr Ser Lys Arg Pro Trp Val Asn Asp Met Leu Gly 
180 185 190 
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Leu Gly Ala Asp Phe lie Gly Gly 
195 200 



His Trp Thr His His Ala Tyr Thr 
210 215 



Ser Phe Gly Ala Glu Pro Met Leu 
225 230 



Ser Lys Trp Leu Trp Gin Glu Gin 
205 



Asn His Ala Glu Met Asp Pro Asp 
220 



Leu Phe Asn Asp Tyr Pro. Leu Asp 
235 240 



His Pro Ala Arg Thr Trp Leu His Arg Phe Gin Ala Phe Phe Tyr Met 
245 250 255 



Pro Val Leu Ala Gly Tyr Trp Leu Ser Ala Val Phe Asn Pro Gin lie 
260 265 270 



Leu Asp Leu Gin Gin Arg Gly Ala Leu Ser Val Gly lie Arg Leu Asp 
275 280 285 



Asn Ala Phe lie His Ser Arg Arg Lys Tyr Ala Val Phe Trp Arg Ala 
290 295 300 



Val Tyr lie Ala Val Asn Val lie Ala Pro Phe Tyr Thr Asn Ser Gly 
305 310 315 320 



Leu Glu Trp Ser Trp Arg Val Phe Gly Asn lie Met Leu Met Gly Val 
325 33.0 335 



Ala Glu Ser Leu Ala Leu Ala Val Leu Phe Ser Leu Ser His Asn Phe 
340 345 350 



Glu Ser Ala Asp Arg Asp Pro Thr Ala Pro Leu Lys Lys Thr Gly Glu 
355 360 365 



Pro Val Asp Trp Phe Lys Thr Gin Val Glu Thr Ser Cys Thr Tyr Gly 
370 375 380 



Gly Phe Leu Ser Gly Cys Phe Thr Gly Gly Leu Asn Phe Gin Val Glu 
385 '390 395 400 



His His Leu Phe Pro Arg Met Ser Ser Ala Trp Tyr Pro Tyr lie Ala 
405 410 415 
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Pro Lys Val 



Arg Glu lie Cys Ala Lys His Gly Val His Tyr Ala Tyr 
420 425 430 



Tyr Pro Trp 
435 



lie His Gin Asn Phe Leu Ser Thr Val Arg Tyr Met His 
440 445 



Ala Ala Gly 
450 



Thr Gly Ala Asn Trp Arg Gin Met Ala Arg Glu Asn Pro 
455 460 



Leu Thr Gly Arg Ala 
465 



<210> 23 

<211> 26 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1) . . (26) 

<223> polylinker sequence 



<400> 23 

gaattcggcg cgccgagctc ctcgag 26 



<210> 24 

<211> 265 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(265) 

<223> polyl inker- terminator-polylinkers sequence 



<400> 24 

ccaccgcggt gggcggccgc ctgcagtcta gaaggcctcc tgctttaatg agatatgcga 



60 



gacgcctatg atcgcatgat atttgctttc aattctgttg tgcacgttgt aaaaaacctg 



120 



agcatgtgta gctcagatcc ttaccgccgg tttcggttca ttctaatgaa tatatcaccc 



180 



gttactatcg tatttttatg aataatattc tccgttcaat ttactgattg tccgtcgacg 



240 



aattcgagct cggcgcgcca agctt 



265 



<210> 25 
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<211> 257 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(257) 

<223> polylinker-terminator-polyl inkers sequence 



<400> 25 

ggatccgata tcgggcccgc tagcgttaac cctgctttaa tgagatatgc gagacgccta 60 

tgatcgcatg atatttgctt tcaattctgt tgtgcacgtt gtaaaaaacc tgagcatgtg 120 

tagctcagat ccttaccgcc ggtttcggtt cattctaatg aatatatcac ccgttactat 180 

cgtattttta tgaataatat tctccgttca atttactgat tgtccgtcga cgaattcgag 240 

ctcggcgcgc caagctt 257 

<210> 26 

<211> 5410 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(5410) 

<223> plant expression vector with one promoter-terminator expression cassette 



<400>. 26 
ttttggaaat 


gatttgcatg 


gaagccatgt 


gtaaaaccat 


gacatccact 


tggaggatgc 


60 


aataatgaag 


aaaactacaa 


atttacatgc 


aactagttat 


gcatgtagtc 


tatataatga 


120 


ggattttgca 


atactttcat 


tcatacacac 


tcactaagtt 


ttacacgatt 


ataatttctt 


180 


catagccagc 


ggatccgata 


tcgggcccgc 


tagcgttaac 


cctgctttaa 


tgagatatgc 


240 


gagacgccta 


tgatcgcatg 


atatttgctt 


tcaattctgt 


tgtgcacgtt 


gtaaaaaacc 


300 


tgagcatgtg 


tagctcagat 


ccttaccgcc 


ggtttcggtt 


cattctaatg 


aatatatcac 


360 


ccgttactat 


cgtattttta 


tgaataatat 


tctccgttca 


atttactgat 


tgtccgtcga 


420 


gcaaatttac 


acattgccac 


taaacgtcta 


aacccttgta 


atttgttttt 


gttttactat 


480 


gtgtgttatg 


tatttgattt 


gcgataaatt 


tttatatttg 


gtactaaatt 


tataacacct 


540 


tttatgctaa 


cgtttgccaa 


cacttagcaa 


tttgcaagtt 


gattaattga 


ttctaaatta 


600 


tttttgtctt 


ctaaatacat 


atactaatca 


actggaaatg 


taaatatttg 


ctaatatttc 


660 


tactatagga 


gaattaaagt 


gagtgaatat 


ggtaccacaa 


ggtttggaga 


tttaattgtt 


720 
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gcaatgctgc atggatggca tatacaccaa acattcaata attcttgagg ataataatgg 7 80 

taccacacaa gatttgaggt gcatgaacgt cacgtggaca aaaggtttag taatttttca 840 

agacaacaat gttaccacac acaagttttg aggtgcatgc atggatgccc tgtggaaagt 900 

ttaaaaatat tttggaaatg atttgcatgg aagccatgtg taaaaccatg acatccactt 960 

ggaggatgca ataatgaaga aaactacaaa tttacatgca actagttatg catgtagtct 1020 

atataatgag gattttgcaa tactttcatt catacacact cactaagttt tacacgatta 1080 

taatttcttc atagccagca gatctgccgg catcgatccc gggccatggc ctgctttaat 1140 

gagatatgcg agacgcctat gatcgcatga tatttgcttt caattctgtt gtgcacgttg 1200 

taaaaaacct. gagcatgtgt agctcagatc cttaccgccg gtttcggttc attctaatga 1260 

atatatcacc cgttactatc gtatttttat gaataatatt ctccgttcaa tttactgatt 1320 

gtccgtcgac . gagctcggcg cgccaagctt ggcgtaatca tggtcatagc tgtttcctgt 1380 

gtgaaattgt tatccgctca caattccaca caacatacga gccggaagca taaagtgtaa 1440 

agcctggggt gcctaatgag tgagctaact cacattaatt gcgttgcgct cactgcccgc 1500 

tttccagtcg ggaaacctgt cgtgccagct gcattaatga atcggccaac gcgcggggag 1560 

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt 162 0 

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga 1680 

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg 1740 

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa 1800 

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt 1860 

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct 1920 

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct 1980 

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc 2040 

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt 2100 

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc 2160 

cacagagttc ttgaagtggt • ggcctaacta cggctacact agaaggacag tatttggtat 2220 

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa 2280 

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa 2340 
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aaaaggatct 


caagaagatc 


ctttgatctt 


ttctacgggg 


tctgacgctc 


agtggaacga 


2400 


aaactcacgt 


taagggattt 


tggtcatgag 


attatcaaaa 


aggatcttca 


cctagatcct 


2460 


tttaaattaa 


aaatgaagtt 


ttaaatcaat 


ctaaagtata 


tatgagtaaa 


cttggtctga 


2520 


cagttaccaa 


tgcttaatca 


gtgaggcacc 


tatctcagcg 


atctgtctat 


ttcgttcatc 


2580 


catagttgcc 


tgactccccg 


tcgtgtagat 


aactacgata 


cgggagggct 


taccatctgg 


2640 


ccccagtgct 


gcaatgatac 


cgcgagaccc 


acgctcaccg 


gctccagatt 


.tatcagcaat 


2700 


aaaccagcca 


gccggaaggg 


ccgagcgcag 


aagtggtcct 


gcaactttat 


ccgcctccat 


2760 


ccagtctatt 


aattgttgcc 


gggaagctag 


agtaagtagt 


tcgccagtta 


atagtttgcg 


2820 


caacgttgtt 


gccattgcta 


caggcatcgt 


ggtgtcacgc 


tcgtcgtttg 


gtatggcttc 


2880 


attcagctcc 


ggttcccaac 


gatcaaggcg 


agttacatga 


tcccccatgt 


tgtgcaaaaa 


2940 


agcggttagc 


tccttcggtc 


ctccgatcgt 


tgtcagaagt 


aagttggccg 


cagtgttatc 


3000 


actcatggtt 


atggcagcac 


tgcataattc 


tcttactgtc 


atgccatccg 


taagatgctt 


3060 


ttctgtgact 


ggtgagtact 


caaccaagtc 


attctgagaa 


tagtgtatgc 


ggcgaccgag 


3120 


ttgctcttgc 


ccggcgtcaa 


tacgggataa 


taccgcgcca 


catagcagaa 


ctttaaaagt 


3180 


g.ctcatcatt 


ggaaaacgtt 


cttcggggcg 


aaaactctca 


aggatcttac 


cgctgttgag 


3240 


atccagttcg 


atgtaaccca 


ctcgtgcacc 


caactgatct 


tcagcatctt 


ttactttcac 


3300 


cagcgtttct 


gggtgagcaa 


aaacaggaag 


gcaaaatgcc 


gcaaaaaagg 


gaataagggc 


3360 


gacacggaaa 


tgttgaatac 


tcatactctt 


cctttttcaa 


tattattgaa 


gcatttatca 


3420 


gggttattgt 


ctcatgagcg 


gatacatatt 


tgaatgtatt 


tagaaaaata 


aacaaatagg 


3480 


ggttccgcgc 


acatttcccc 


gaaaagtgcc 


acctgacgtc 


taagaaacca 


ttattatcat 


3540 


gacattaacc 


tataaaaata 


ggcgtatcac 


gaggcccttt 


cgtctcgcgc 


gtttcggtga 


3600 


tgacggtgaa 


aacctctgac 


acatgcagct 


cccggagacg 


gtcacagctt 


gtctgtaagc 


3660 


ggatgccggg 


agcagacaag 


cccgtcaggg 


cgcgtcagcg 


ggtgttggcg 


ggtgtcgggg 


3720 


ctggcttaac 


tatgcggcat 


cagagcagat 


tgtactgaga 


gtgcaccata 


tgcggtgtga 


3780 


aataccgcac 


agatgcgtaa 


ggagaaaata 


ccgcatcagg 


cgccattcgc 


cattcaggct 


3840 


gcgcaactgt 


tgggaagggc 


gatcggtgcg 


ggcctcttcg 


ctattacgcc 


agctggcgaa 


3900 


agggggatgt 


gctgcaaggc 


gattaagttg 


ggtaacgcca 


gggttttccc 


agtcacgacg 


3960 


ttgtaaaacg 


acggccagtg 


aattcggcgc 


gccgagctcc 


tcgagcaaat 


ttacacattg 


4020 


ccactaaacg 


tctaaaccct 


tgtaatttgt 


ttttgtttta 


ctatgtgtgt 


tatgtatttg 


4080 
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atttgcgata 


aatttttata 


tttggtacta 


aatt tataac 


accttttatg . 


ctaacgtttg 


ZL1 AO 


ccaacactta 


gcaat.ttgca 


agttgattaa 


ttgattctaa 


attatttttg 


tcttctaaat 


4tZ U U 


acatatacta 


atcaactgga 


aatgtaaata 


tttgetaata 


tttctactat 


aggagaatta 


a o & n 


aagtgagtga 


atatggtacc 


acaaggtttg 


gagatttaat 


tgttgcaatg ctgcatggat 


a i o n 


ggcatataca 


ccaaacattc 


aataattctt 


gaggataata 


atggtaccac 


acaagatttg 


/i *i on 
4j O U 


aggtgcatga acgtcacgtg 


gacaaaaggt 


ttagtaattt 


ttcaagacaa 


caatgttacc 


4440 


acacacaagt 


tttgaggtgc 


atgcatggat 


gccctgtgga 


aagtttaaaa 


atattttgga 


4500 


aatgatttgc 


atggaagcca 


tgtgtaaaac 


catgacatcc 


acttggagga 


tgcaataatg 


4560 


aagaaaacta 


caaatttaca 


tgcaactagt 


tatgcatgta 


gtctatataa 


tgaggatttt 


4620 


fTPR^tafttt 


cat teat aca 


cactcactaa 


gttttacacg 


attataattt 


cttcatagcc 


a c o n 
4 doU 


Pi CTC'CCFlC'C'CIC 


yy ^yyy^yy*- 


cgcctgcagt 


etagaaggee 


tectgettta 


ataacratatcr 


4 /4U 


pcracracacct 

v-* ^3 cxy t-4. v^- 


atgategcat 


gatatttget 


ttcaattctg 


ttgtgcacgt 


tgtaaaaaac 


/l o c\ c\ 


ctgagcatgt 


gtagctcaga 


tccttaccgc 


cggtttcggt 


1 tcattctaat 


gaatatatca 


/I Q C C\ 


cccgttacta 


tegtattttt 


atgaataata 


ttctccgttc 


aatttactga 


ttgtccgtcg 


yi OTA 


agcaaattta 


cacattgcca 


etaaaegtet 


aaacccttgt 


aatttgtttt 


tgttttacta 


a n o r\ 

4980 


tgtgtgttat 


gtatttgatt 


tgcgataaat 


ttttatattt 


ggtactaaat 


ttataacacc 


rfi/l n 


ttttatgcta 


aegtttgeca 


acacttagca 


atttgeaagt 


tgattaattg 


attctaaatt 


blUU 


atttttgtct 


tctaaataca 


tatactaatc 


aactggaaat 


gtaaatattt 


gctaatattt 


jibu 


ctactatagg 


agaattaaag 


tgagtgaata 


tggtaccaca aggtttggag 


atttaattgt 


5220 


tgcaatgctg 


catggatggc 


atatacacca 


aacattcaat 


aattcttgag 


gataataatg 


5280 


gtaccacaca 


agatttgagg 


tgcatgaacg 


tcacgtggac 


aaaaggttta 


gtaatttttc 


5340 


aagacaacaa 


tgttaccaca 


cacaagtttt 


gaggtgcatg 


catggatgee 


ctgtggaaag 


5400 



tttaaaaata 5410 



<210> 27 

<211> 12093 

<212> DNA 

<213> artificial sequence 



<220> 

<221> misc_f eature 
<222> (1) . . (12093) 
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<223> plant expression vector with one promoter- terminator expression cassette 
<400> 27 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 
tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 
atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 3 00 
ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 3 60 
tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 
gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 
tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 
ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 
ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 
gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 
ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 
ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 
ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 102 0 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 12 60 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 13 80 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 
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tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg. tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 3300 
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tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3 840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 4020 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 43 80 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
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ttagccgaat 


tggattactt 


actgaataac 


gatctggccg 


atgtggattg. 


cgaaaactgg 


5100 


gaagaagaca 


ctccatttaa 


agatccgcgc 


gagctgtatg 


attttttaaa 


gacggaaaag 


5160 


cccgaagagg 


aacttgtctt 


ttcccacggc 


gacctgggag 


acagcaacat 


ctttgtgaaa 


5220 


gatggcaaag 


taagtggctt 


tattgatctt 


gggagaagcg 


gcagggcgga 


caagtggtat 


5280 


gacattgcct 


tctgcgtccg 


gtcgatcagg 


gaggatatcg 


gggaagaaca 


gtatgtcgag 


5340 


ctattttttg 


acttactggg 


gatcaagcct 


gattgggaga 


aaataaaata 


ttatatttta 


5400 


ctggatgaat 


tgttttagta 


cctagatgtg 


gcgcaacgat 


gccggcgaca 


agcaggagcg 


5460 


caccgacttc 


ttccgcatca 


agtgttttgg 


ctctcaggcc 


gaggcccacg 


gcaagtattt 


5520 


gggcaagggg 


tcgctggtat 


tcgtgcaggg 


caagattcgg 


aataccaagt 


acgagaagga 


5580 


cggccagacg. 


gtctacggga 


ccgacttcat 


tgccgataag 


gtggattatc 


tggacaccaa 


5640 


ggcaccaggc 


gggtcaaatc 


aggaataagg 


gcacattgcc 


ccggcgtgag 


tcggggcaat 


5700 


cccgcaagga 


gggtgaatga 


atcggacgtt 


tgaccggaag 


gcatacaggc 


aagaactgat 


5760 


cgacgcgggg 


ttttccgccg 


aggatgccga 


aaccatcgca 


agccgcaccg 


tcatgcgtgc 


5820 


gccccgcgaa 


accttccagt 


ccgtcggctc 


gatggtccag 


caagctacgg 


ccaagatcga 


5880 


gcgcgacagc 


gtgcaactgg 


ctccccctgc 


cctgcccgcg 


ccatcggccg 


ccgtggagcg 


5940 


ttcgcgtcgt 


ctcgaacagg 


aggcggcagg 


tttggcgaag 


tcgatgacca 


tcgacacgcg 


6000 


aggaactatg 


acgaccaaga 


agcgaaaaac 


cgccggcgag 


gacctggcaa 


aacaggtcag 


6060 


cgaggccaag 


caggccgcgt 


tgctgaaaca 


cacgaagcag 


cagatcaagg 


aaatgcagct 


6120 


ttccttgttc 


gatattgcgc 


cgtggccgga 


cacgatgcga 


gcgatgccaa 


acgacacggc 


6180 


ccgctctgcc 


ctgttcacca 


cgcgcaacaa 


gaaaatcccg 


cgcgaggcgc 


tgcaaaacaa 


6240 


ggtcattttc 


cacgtcaaca 


aggacgtgaa 


gatcacctac 


accggcgtcg 


agctgcgggc 


6300 


cgacgatgac 


gaactggtgt 


ggcagcaggt 


gttggagtac 


gcgaagcgca 


cccctatcgg 


6360 


cgagccgatc 


accttcacgt 


tctacgagct 


ttgccaggac 


ctgggctggt 


cgatcaatgg 


6420 


ccggtattac 


acgaaggccg 


aggaatgcct 


gtcgcgccta 


caggcgacgg 


cgatgggctt 


6480 


cacgtccgac 


cgcgttgggc 


acctggaatc 


ggtgtcgctg 


ctgcaccgct 


tccgcgtcct 


6540 


ggaccgtggc 


aagaaaacgt 


cccgttgcca 


ggtcctgatc 


gacgaggaaa 


tcgtcgtgct 


6600 


gtttgctggc 


gaccactaca 


cgaaattcat 


atgggagaag 


taccgcaagc 


tgtcgccgac 


6660 


ggcccgacgg 


atgttcgact 


atttcagctc 


gcaccgggag 


ccgtacccgc 


tcaagctgga 


6720 
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aaccttccgc 


ctcatgtgcg 


gatcggattc 


cacccgcgtg 


aagaagtggc 


gcgagcaggt 


6780 


cggcgaagcc 


tgcgaagagt 


tgcgaggcag 


cggcctggtg 


gaacacgcct 


gggtcaatga 


6840 


tgacctggtg 


cattgcaaac 


gctagggcct 


tgtggggtca 


gttccggctg 


ggggttcagc 


6900 


agccagcgct 


ttactggcat 


ttcaggaaca 


agcgggcact 


gctcgacgca 


cttgcttcgc 


6960 


tcagtatcgc 


tcgggacgca 


cggcgcgctc 


tacgaactgc 


cgataaacag 


aggattaaaa 


7020 


ttgacaattg 


tgattaaggc 


tcagattcga 


cggcttggag 


cggccgacgt 


gcaggatttc 


7080 


cgcgagatcc 


gattgtcggc 


cctgaagaaa 


gctccagaga 


tgttcgggtc 


cgtttacgag 


7140 


cacgaggaga 


aaaagcccat 


ggaggcgttc 


gctgaacggt 


tgcgagatgc 


cgtggcattc 


7200 


ggcgcctaca 


tcgacggcga 


gatcattggg 


ctgtcggtct 


tcaaacagga 


ggacggcccc 


7260 


aaggacgctc 


acaaggcgca 


tctgtccggc 


gttttcgtgg 


agcccgaaca 


gcgaggccga 


7320 


ggggtcgccg 


gtatgctgct 


gcgggcgttg 


ccggcgggtt 


tattgctcgt 


gatgatcgtc 


7380 


cgacagattc 


caacgggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


7440 


tcgctattct 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgccgggcgg 


ggtcgcggcg 


7500 


acggtaggcg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


ctgccgctct 


gctaggtagc 


7560 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgcggaa 


ctgcgggcgt 


ggcgctgttg 


7620 


gtgttgacac 


caaacgcagc 


gctagatcct 


gtcggcgtcg 


cagcgggcct 


ggcgggggcg 


7680 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


7740 


acctttaccg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


7800 


tttgatccgc 


caatcccgat 


gcctacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


7860 


ctgatcggag 


cgggtttaac 


ctacttcctt 


tggttccggg 


ggatctcgcg 


actcgaacct 


7920 


acagttgttt 


ccttactggg 


ctttctcagc 


cccagatctg 


gggtcgatca 


gccggggatg 


7980 


catcaggccg 


acagtcggaa 


cttcgggtcc 


ccgacctgta 


ccattcggtg 


agcaatggat 


8040 


aggggagttg 


atatcgtcaa 


cgttcacttc 


taaagaaata 


gcgccactca 


gcttcctcag 


8100 


cggctttatc 


cagcgatttc 


ctattatgtc 


ggcatagttc 


tcaagatcga 


cagcctgtca 


8160 


cggttaagcg 


agaaatgaat 


aagaaggctg 


ataattcgga 


tctctgcgag 


ggagatgata 


8220 


tttgatcaca 


ggcagcaacg 


ctctgtcatc 


gttacaatca 


acatgctacc 


ctccgcgaga 


8280 


tcatccgtgt 


ttcaaacccg 


gcagcttagt 


tgccgttctt 


ccgaatagca 


tcggtaacat 


8340 


gagcaaagtc 


tgccgcctta 


caacggctct 


cccgctgacg 


ccgtcccgga 


ctgatgggct 


8400 


gcctgtatcg 


agtggtgatt 


ttgtgccgag 


ctgccggtcg 


gggagctgtt 


ggctggctgg 


8460 
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tggcaggata tattgtggtg taaacaaatt 
gacgttttta atgtactggg gtggtttttc 
tgcccttcac cgcctggccc tgagagagtt 
gcaggcgaaa atcctgtttg atggtggttc 
agaatagccc gagatagggt tgagtgttgt 
gaacgtggac tccaacgtca aagggcgaaa 
tgaaccatca cccaaatcaa gttttttggg 
ccctaaaggg agcccccgat ttagagcttg 
ggaagggaag aaagcgaaag gagcgggcgc 
gatcggtgcg. ggcctcttcg ctattacgcc 
gattaagttg ggtaacgcca gggttttccc 
aattaattcc catcttgaaa gaaatatagt 
ggtattatag tccaagcaaa aacataaatt 
tcaataactg attatatcag ctggtacatt 
tgtgtaatac ataaattgat gatatagcta 
agcttgggtc ccgctcagaa gaactcgtca 
tcgggagcgg cgataccgta aagcacgagg 
tcagcaatat cacgggtagc caacgctatg 
ccacagtcga tgaatccaga aaagcggcca 
tcgccatggg tcacgacgag atcctcgccg 
agttcggctg gcgcgagccc ctgatgctct 
gcttccatcc gagtacgtgc tcgctcgatg 
gtagccggat caagcgtatg cagccgccgc 
gcaggagcaa ggtgagatga caggagatcc 
tcccttcccg cttcagtgac aacgtcgagc 
agccacgata gccgcgctgc cccgtcctgc 
ttgacaaaaa gaaccgggcg cccctgcgct 
ccgattgtct gttgtgccca gtcatagccg 



gacgcttaga caacttaata. acacattgcg 8520 

ttttcaccag tgagacgggc aacagctgat 8580 

gcagcaagcg gtccacgctg gtttgcccca 8640 

cgaaatcggc aaaatccctt ataaatcaaa 8700 

tccagtttgg aacaagagtc cactattaaa 8760 

aaccgtctat cagggcgatg gcccactacg 8820 

gtcgaggtgc cgtaaagcac taaatcggaa 8880 

acggggaaag ccggcgaacg tggcgagaaa 8940 

cattcaggct gcgcaactgt tgggaagggc 9000 

agctggcgaa agggggatgt gctgcaaggc 9060 

agtcacgacg ttgtaaaacg acggccagtg 9120 

ttaaatattt attgataaaa taacaagtca 9180 

tattgatgca agtttaaatt cagaaatatt 9240 

gccgtagatg aaagactgag tgcgatatta 9300 

gcttagctca tcgggggatc cgtcgaagct 9360 

agaaggcgat agaaggcgat gcgctgcgaa 942 0 

aagcggtcag cccattcgcc gccaagctct 9480 

tcctgatagc ggtccgccac acccagccgg 9540 

ttttccacca tgatattcgg caagcaggca 9600 

tcgggcatgc gcgccttgag cctggcgaac 9660 

tcgtccagat catcctgatc gacaagaccg 972 0 

cgatgtttcg cttggtggtc gaatgggcag 9780 

attgcatcag ccatgatgga tactttctcg 9840 

tgccccggca cttcgcccaa tagcagccag 9900 

acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agttcattca gggcaccgga caggccggtc 10020 

gacagccgga acacggcggc atcagagcag 10080 

aatagcctct ccacccaagc ggccggagaa 10140 
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cctgcgtgca atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcga.cg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 112 80 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 

tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 

aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 

atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 

gtcgacgaat tcgagctcgg cgcgcctcta gaggatcgat gaattcagat cggctgagtg 11820 

gctccttcaa cgttgcggtt ctgtcagttc caaacgtaaa acggcttgtc ccgcgtcatc 11880 



68 



ggcgggggtc ataacgtgac tcccttaatt ctccgctcat gatcagattg . tcgtttcccg 11940 

ccttcagttt aaactatcag tgtttgacag gatatattgg cgggtaaacc taagagaaaa 12000 

gagcgtttat tagaataatc ggatatttaa aagggcgtga aaaggtttat ccttcgtcca 12060 

tttgtatgtg catgccaacc acagggttcc cca 12093 



<210> 28 

<211> 12085 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1) . . (12085) 

<223> plant expression vector with one promoter- terminator expression cassette 



<400> 28 
gatctggcgc 


cggccagcga 


gacgagcaag 


attggccgcc 


gcccgaaacg 


atccgacagc 


60 


gcgcccagca 


caggtgcgca 


ggcaaattgc 


accaacgcat 


acagcgccag 


cagaatgeca 


120 


1~ self onococj 


taacat cert t 


cgagtgaacc 


agatcgegea 


crcracrcrc ccqq 


cagcaccggc 


180 


ataatcaggc 


cgatgccgac 


agegtcgage 


gcgacagtgc 


tcagaattac 


gatcaggggt 


240 


atgttgggtt 


teaegtctgg 


cctccggacc 


agcctccgct 


ggtccgattg 


aacgegegga 


300 


ttctttatca 


ctgataagtt 


ggtggacata 


ttatgtttat 


cagtgataaa 


gtgtcaagca 


360 


tgacaaagtt 


geagecgaat 


acagtgatcc 


gtgccgccct 


ggacctgttg 


aacgaggtcg 


420 


gcgtagacgg 


tctgacgaca 


cgcaaactgg 


eggaaeggtt 


gggggttcag 


cagccggcgc 


480 


tttactggca 


cttcaggaac 


aagegggege 


tgctcgacgc 


actggccgaa 


gecatgetgg 


540 


cggagaatca 


tacgeatteg 


gtgecgagag 


ccgacgacga 


ctggcgctca 


tttctgatcg 


600 


ggaatgcccg 


cagcttcagg 


caggcgctgc 


tcgcctaccg 


cgatggcgcg 


cgcatccatg 


660 


ccggcacgcg 


accgggcgca 


ccgcagatgg 


aaaeggcega 


cgcgcagctt 


cgcttcctct 


720 


gcgaggcggg 


ttttteggee 


ggggacgccg 


teaatgeget 


gatgacaatc 


agctacttca 


780 


ctgttggggc 


cgtgcttgag 


gagcaggecg 


gegacagega 


tgeeggegag 


cgcggcggca 


840 


ccgttgaaca 


ggctccgctc 


tcgccgctgt 


tgcgggccgc 


gatagacgee 


ttcgacgaag 


900 


ccggtccgga 


cgcagcgttc 


gagcagggac 


tegeggtgat 


tgtcgatgga 


ttggcgaaaa 


960 


ggaggctcgt 


tgtcaggaac 


gttgaaggac 


cgagaaaggg 


tgacgattga 


tcaggaccgc 


1020 
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tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 12 60 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 13 80 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 192 0 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2 040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2640 

gatgtggaca agcctgggga taagtgccct gcggtattga cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 
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gcacctattg 


acatttgagg 


ggctgtccac 


aggcagaaaa 


tccagcattt . 


gcaagggttt 


z oz U 


ccgcccgttt 


ttcggccacc 


gctaacctgt 


cttttaacct 


gcttttaaac 


caatatt cat 


a o q a 
z ooU 


aaaccttgtt 


tttaaccagg 


gctgcgccct 


gtgcgcgtga 


ccgcgcacgc 


cgaagggggg 


a a A A 

z940 


tgccccccct 


tctcgaaccc 


tcccggcccg 


ctaacgcggg 


cctcccatcc 


ccccaggggc 


"j a a a 

3000 


tgcgcccctc 


ggccgcgaac 


ggcctcaccc 


caaaaatggc 


agcgctggca 


gtccttgcca 


O A C A 

3060 


ttgccgggat 


cggggcagta 


acgggatggg 


cgatcagccc 


gagcgcgacg 


cccggaagca 


O 1 A A 

JlzO 


ttgacgtgcc 


gcaggtgctg 


gcatcgacat 


tcagcgacca 


ggtgccgggc 


agtgagggcg 


3180 


gcggcctggg 


■tggcggcctg 


cccttcactt 


cggccgtcgg 


ggcattcacg 


gacttcatgg 


"3 A A A 

3z40 


cggggccggc 


aatttttacc 


ttgggcattc 


ttggcatagt 


ggtcgcgggt 


gccgtgctcg 


O *5 A A 

3300 


tgttcggggg. 


tgcgataaac 


ccagcgaacc 


atttgaggtg 


ataggtaaga 


ttataccgag 


3360 


gtatgaaaac 


gagaattgga 


cctttacaga 


attactctat gaagcgccat 


atttaaaaag 


3420 


ctaccaagac 


gaagaggatg 


aagaggatga 


ggaggcagat 


tgccttgaat 


atattgacaa 


3480 


tactgataag 


ataatatatc 


ttttatatag 


aagatatcgc 


cgtatgtaag 


gatttcaggg 


3540 


ggcaaggcat 


aggcagcgcg 


cttatcaata 


tatctataga 


atgggcaaag 


cataaaaact 


3600 


tgcatggact 


aatgcttgaa 


acccaggaca 


ataaccttat 


agcttgtaaa 


ttctatcata 


3660 


attgggtaat 


gactccaact 


tattgatagt 


gttttatgtt 


cagataatgc 


ccgatgactt 


O 1 A A 

37z0 


tgtcatgcag 


ctccaccgat 


tttgagaacg 


acagcgactt 


ccgtcccagc 


cgtgccaggt 


T t o n 

3780 


gctgcctcag 


attcaggtta 


tgccgctcaa 


ttcgctgcgt 


atatcgcttg 


ctgattacgt 


O O A A 

3840 


gcagctttcc 


cttcaggcgg 


gattcataca 


gcggccagcc 


atccgtcatc 


catatcacca 


"5 A A A 

3900 


cgtcaaaggg 


tgacagcagg 


ctcataagac 


gccccagcgt 


cgccatagtg 


cgttcaccga 


1 A ZT A 

3960 


atacgtgcgc 


aacaaccgtc 


ttccggagac 


tgtcatacgc 


gtaaaacagc 


cagcgctggc 


A A A A 

40z 0 


gcgatttagc 


cccgacatag 


ccccactgtt 


cgtccatttc 


cgcgcagacg 


atgacgtcac 


/l A O A 

4080 


tgcccggctg 


tatgcgcgag 


gttaccgact 


gcggcctgag 


ttttttaagt 


gacgtaaaat 


A 1 /I A 

4140 


cgtgttgagg 


ccaacgccca 


taatgcgggc 


tgttgcccgg 


catccaacgc 


cattcatggc 


/I A A A 

4200 


catatcaatg 


attttctggt 


gcgtaccggg 


ttgagaagcg 


gtgtaagtga 


actgcagttg 


4260 


ccatgtttta 


cggcagtgag 


agcagagata 


gcgctgatgt 


ccggcggtgc 


ttttgccgtt 


4320 


acgcaccacc 


ccgtcagtag 


ctgaacagga 


gggacagctg 


atagacacag 


aagccactgg 


4380 


agcacctcaa 


aaacaccatc 


atacactaaa 


tcagtaagtt 


ggcagcatca 


cccataattg 


4440 
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tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat -gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 
cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa . 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 
ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta - 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 



ccgctctgcc 


ctgttcacca 


cgcgcaacaa 


gaaaatcccg 


cgcgaggcgc . 


tgcaaaacaa 


OZ4U 


ggtcattttc 


cacgtcaaca 


aggacgtgaa 


gatcacctac 


accggcgtcg 


agetgeggge 


£ *5 a a 

6300 


cgacgatgac 


gaactggtgt 


ggcagcaggt 


gttggagtac 


gegaagegea 


cccctatcgg 


636U 


cgagccgatc 


accttcacgt 


tctacgagct 


ttgecaggae 


ctgggctggt 


cgatcaatgg 


C A A A 

6420 


ccggtattac 


aegaaggecg 


aggaatgect 


gtcgcgccta 


caggegaegg 


cgatgggctt 


C A O A 

64 80 


cacgtccgac 


cgcgttgggc 


acctggaatc 


ggtgtcgctg 


ctgcaccgct 


tccgcgtcct 


ez n a A 

6540 


ggaccgtggc 


aagaaaacgt 


cccgttgcca 


ggtcctgatc 


gacgaggaaa 


tegtegtget 


C C A A 

6600 


gtttgctggc 


gaccactaca 


cgaaattcat 


atgggagaag 


taccgcaagc 


tgtcgccgac 


C C (2 A 

6660 


ggcccgacgg 


atgttcgact 


atttcagctc 


geacegggag 


ccgtacccgc 


tcaagctgga 


6720 


aaccttccgc 


ctcatgtgcg 


gateggatte 


cacccgcgtg 


aagaagtggc 


gcgagcaggt 


6780 


cggcgaagcc 


tgcgaagagt 


tgcgaggcag 


cggcctggtg gaacacgcct 


gggtcaatga 


6840 


tgacctggtg 


cattgeaaac 


getagggect 




gttccggctg 


ggggttcagc 


6900 


age cage get 


ttactggcat 


ttcaggaaca 


acrccrcrcrcact 


gctcgacgca 


cttgcttcgc 


6960 


teagtatege 


tegggacgea 


cggcgcgctc 


tacgaactgc 


cgataaacag 


aggattaaaa 


7020 


ttgacaattg 


tgattaaggc 


tcagattcga 


eggcttggag 


cggccgacgt 


gcaggatttc 


rj AAA 

7080 


cgcgagatcc 


gattgtegge 


cctgaagaaa 


gctccagaga 


tgttcgggtc 


cgtttacgag 


*1 1 A A 

7140 


cacgaggaga 


aaaageccat 


ggaggcgttc 


getgaaeggt 


tgcgagatgc 


cgtggcattc 


7200 


ggcgcctaca 


tegaeggega 


gatcattggg 


ctgtcggtct 


tcaaacagga 


ggacggcccc 


7260 


aaggacgetc 


acaaggegea 


tctgtccggc 


gttttcgtgg 


agcccgaaca 


gegaggcega 


T-) A A 

7320 


ggggtcgccg 


gtatgetget 


gcgggcgttg 


ccggcgggtt 


tattgetegt 


gatgategtc 


1 "3 O A 

73 80 


cgacagattc 


caaegggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


1 A A A 

/440 


tegctattet 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgeegggegg 


ggtcgcggcg 


*7 C A A 

/ bUO 


aeggtaggeg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


ctgccgctct 


gctaggtagc 


H C C A 

/bou 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgeggaa 


ctgcgggcgt 


ggcgctgttg 


"i e a a 

7620 


gtgttgacac 


caaacgcagc 


gctagatcct 


gteggegteg 


cagcgggcct 


ggcgggggcg 


7680 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


7740 


acctttaccg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


7800 


tttgatcege 


caatcccgat 


gectacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


7860 
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ctgatcggag cgggtttaac ctacttcctt tggttccggg ggatctcgcg actcgaacct 7920 

acagttgttt ccttactggg ctttctcagc cccagatctg gggtcgatca gccggggatg 7980 

catcaggccg acagtcggaa cttcgggtcc ccgacctgta ccattcggtg agcaatggat 8040 

aggggagttg atatcgtcaa cgttcacttc taaagaaata gcgccactca gcttcctcag 8100 

cggctttatc cagcgatttc ctattatgtc ggcatagttc tcaagatcga cagcctgtca 8160 

cggttaagcg agaaatgaat aagaaggctg ataattcgga tctctgcgag ggagatgata 8220 

tttgatcaca ggcagcaacg ctctgtcatc gttacaatca acatgctacc ctccgcgaga 82 80 

tcatccgtgt ttcaaacccg gcagcttagt tgccgttctt ccgaatagca tcggtaacat 8340 

gagcaaagtc tgccgcctta caacggctct cccgctgacg ccgtcccgga ctgatgggct 8400 

gcctgtatcg agtggtgatt ttgtgccgag ctgccggtcg gggagctgtt ggctggctgg 8460 

tggcaggata tattgtggtg taaacaaatt gacgcttaga caacttaata acacattgcg 8520 

gacgttttta atgtactggg gtggtttttc ttttcaccag tgagacgggc aacagctgat 8580 

tgcccttcac cgcctggccc tgagagagtt gcagcaagcg gtccacgctg gtttgcccca 8640 

gcaggcgaaa atcctgtttg atggtggttc cgaaatcggc aaaatccctt ataaatcaaa 8700 

agaatagccc gagatagggt tgagtgttgt tccagtttgg aacaagagtc cactattaaa 8760 

gaacgtggac tccaacgtca aagggcgaaa aaccgtctat cagggcgatg gcccactacg 8820 

tgaaccatca cccaaatcaa gttttttggg gtcgaggtgc cgtaaagcac taaatcggaa 8880 

ccctaaaggg agcccccgat ttagagcttg acggggaaag ccggcgaacg tggcgagaaa 8940 

ggaagggaag aaagcgaaag gagcgggcgc cattcaggct gcgcaactgt tgggaagggc 9000 

gatcggtgcg ggcctcttcg ctattacgcc agctggcgaa agggggatgt gctgcaaggc 9060 

gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg acggccagtg 9120 

aattaattcc catcttgaaa gaaatatagt ttaaatattt attgataaaa taacaagtca 9180 

ggtattatag tccaagcaaa aacataaatt tattgatgca agtttaaatt cagaaatatt 9240 

tcaataactg attatatcag ctggtacatt gccgtagatg aaagactgag tgcgatatta 9300 

tgtgtaatac ataaattgat gatatagcta gcttagctca tcgggggatc cgtcgaagct 9360 

agcttgggtc ccgctcagaa gaactcgtca agaaggcgat agaaggcgat gcgctgcgaa 9420 

tcgggagcgg cgataccgta aagcacgagg aagcggtcag cccattcgcc gccaagctct 9480 

tcagcaatat cacgggtagc caacgctatg; tcctgatagc ggtccgccac acccagccgg 9540 

ccacagtcga tgaatccaga aaagcggcca ttttccacca tgatattcgg caagcaggca 9600 
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tcgccatggg tcacgacgag atcctcgccg tcgggcatgc gcgccttgag. cctggcgaac 9660 

agttcggctg gcgcgagccc ctgatgctct tcgtccagat catcctgatc gacaagaccg 972 0 

gcttccatcc gagtacgtgc tcgctcgatg cgatgtttcg cttggtggtc gaatgggcag 97 80 

gtagccggat caagcgtatg cagccgccgc attgcatcag ccatgatgga tactttctcg 9840 

gcaggagcaa ggtgagatga caggagatcc tgccccggca cttcgcccaa tagcagccag 9900 

tcccttcccg cttcagtgac aacgtcgagc acagctgcgc aaggaacgcc cgtcgtggcc 9960 

agccacgata gccgcgctgc ctcgtcctgc agttcattca gggcaccgga caggtcggtc 10020 

ttgacaaaaa gaaccgggcg cccctgcgct gacagccgga acacggcggc atcagagcag 10080 

ccgattgtct gttgtgccca gtcatagccg aatagcctct ccacccaagc ggccggagaa 10140 

cctgcgtgca. atccatcttg ttcaatccaa gctcccatgg gccctcgact agagtcgaga 10200 

tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 1092 0 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tLtggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 112 80 
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taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagcgga tccgatatcg ggcccgctag cgttaaccct gctttaatga gatatgcgag 11580 

acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta .aaaaacctga 11640 

gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 

ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 11760 

attcgagctc ggcgcgcctc tagaggatcg atgaattcag atcggctgag tggctccttc 11820 

aacgttgcgg ttctgtcagt tccaaacgta aaacggcttg tcccgcgtca tcggcggggg 11880 

tcataacgtg actcccttaa ttctccgctc atgatcagat tgtcgtttcc cgccttcagt 11940 

ttaaactatc agtgtttgac aggatatatt ggcgggtaaa cctaagagaa aagagcgttt 12000 
attagaataa tcggatattt aaaagggcgt gaaaaggttt atccttcgtc catttgtatg . 12060 

tgcatgccaa ccacagggtt cccca 12085 

<210> 29 
<211> 12079 
<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 
<222> (1).. (12079) 

<223> plant expression vector with one promoter- terminator expression cassette 
<400> 29 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 

gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 12 0 

tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 

ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 
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cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca. tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 900 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 102 0 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc. agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 13 80 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagtag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 

agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgccL gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 
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aacggccggg 


gggtgcgcgt 


gattgccaag 


cacgtcccca 


tgcgctccat 


caagaagagc 


2280 


gacttcgcgg 


agctggtgaa 


gtacatcacc 


gacgagcaag 


gcaagaccga 


gcgcctttgc 


2340 


gacgctcacc 


gggctggttg 


ccctcgccgc 


tgggctggcg 


gccgtctatg 


gccctgcaaa 


2400 


cgcgccagaa 


acgccgtcga 


agccgtgtgc 


gagacaccgc 


ggccgccggc 


gttgtggata 


2460 


cctcgcggaa 


aacttggccc • 


tcactgacag 


atgaggggcg 


gacgttgaca 


cttgaggggc 


2520 


cgactcaccc 


ggcgcggcgt 


tgacagatga 


ggggcaggct 


cgatttcggc 


cggcgacgtg 


2580 


gagctggcca 


gcctcgcaaa 


tcggcgaaaa 


cgcctgattt 


tacgcgagtt 


tcccacagat 


2640 


gatgtggaca 


agcctgggga 


taagtgccct 


gcggtattga 


cacttgaggg 


gcgcgactac 


2700 


tgacagatga 


ggggcgcgat 


ccttgacact 


tgaggggcag 


agtgctgaca 


gatgaggggc 


2760 


gcacctattg 


acatttgagg 


ggctgtccac 


aggcagaaaa 


tccagcattt 


gcaagggttt 


2820 


ccgcccgttt 


ttcggccacc 


gctaacctgt 


cttttaacct 


gcttttaaac 


caatatttat 


2880 


aaaccttgtt 


tttaaccagg 


gctgcgccct 


gtgcgcgtga 


ccgcgcacgc 


cgaagggggg 


2940 


tgccccccct 


tctcgaaccc 


tcccggcccg 


ctaacgcggg 


cctcccatcc 


ccccaggggc 


3000 


tgcgcccctc 


ggccgcgaac 


ggcctcaccc 


caaaaatggc 


agcgctggca 


gtccttgcca 


3060 


ttgccgggat 


cggggcagta 


acgggatggg 


cgatcagccc 


gagcgcgacg 


cccggaagca 


3120 


ttgacgtgcc 


gcaggtgctg 


gcatcgacat 


tcagcgacca 


ggtgccgggc 


agtgagggcg 


3180 


gcggcctggg 


tggcggcctg 


cccttcactt 


cggccgtcgg 


ggcattcacg 


gacttcatgg 


3240 


cggggccggc 


aatttttacc 


ttgggcattc 


ttggcatagt 


ggtcgcgggt 


gccgtgctcg 


3300 


tgttcggggg 


tgcgataaac 


ccagcgaacc 


atttgaggtg 


ataggtaaga 


ttataccgag 


3360 


gtatgaaaac 


gagaattgga 


cctttacaga 


attactctat 


gaagcgccat 


atttaaaaag 


3420 


ctaccaagac 


gaagaggatg 


aagaggacga 


ggaggcagau 


LyCCU LyaaL 


af aj- f naoaa 


J^OU 


tac tgataag 


acaacacacc 




a a rt afaf /~f 

aayaUaLCgC 




n ^3 1* f- 1* c*z* rrrrrr 
yaLL uv-.ciy y y 




ggcaaggcat 


aggcagcgcg 


cttatcaata 


tatctataga 


atgggcaaag 


cataaaaact 


3600 


tgcatggact 


aatgcttgaa 


acccaggaca 


ataaccttat 


agcttgtaaa 


ttctatcata 


3660 


attgggtaat 


gactccaact 


tattgatagt 


gttttatgtt 


cagataatgc 


ccgatgactt 


3720 


tgtcatgcag 


ctccaccgat 


tttgagaacg 


acagcgactt 


ccgtcccagc 


cgtgccaggt 


3780 


gctgcctcag 


attcaggtta 


tgccgctcaa 


ttcgctgcgt 


atatcgcttg 


ctgattacgt 


3840 


gcagctttcc 


cttcaggcgg 


gattcataca 


gcggccagcc 


atccgtcatc 


catatcacca 


3900 
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cgtcaaaggg 


tgacagcagg 


ctcataagac 


gccccagcgt 


cgccatagtg 


cgttcaccga 


3960 


atacgtgcgc 


aacaaccgtc 


ttccggagac 


tgtcatacgc 


gtaaaacagc 


cagcgctggc 


4020 


gcgatttagc 


cccgacatag 


ccccactgtt 


cgtccatttc 


cgcgcagacg 


atgacgtcac 


4080 


tgcccggctg 


tatgcgcgag 


gttaccgact 


gcggcctgag 


ttttttaagt 


gacgtaaaat 


4140 


cgtgttgagg 


ccaacgccca 


taatgcgggc 


tgttgcccgg 


catccaacgc 


cattcatggc 


4200 


catatcaatg 


attttctggt 


gcgtaccggg 


ttgagaagcg 


gtgtaagtga 


actgcagttg 


4260 


ccatgtttta 


cggcagtgag 


agcagagata 


gcgctgatgt 


ccggcggtgc 


ttttgccgtt 


4320 


acgcaccacc 


ccgtcagtag 


ctgaacagga 


gggacagctg 


atagacacag 


aagccactgg 


4380 


agcacctcaa 


aaacaccatc 


atacactaaa 


tcagtaagtt 


ggcagcatca 


cccataattg 


4440 


tggtttcaaa 


atcggctccg 


tcgatactat 


gttatacgcc 


aactttgaaa 


acaactttga 


4500 


aaaagctgtt 


ttctggtatt 


taaggtttta 


gaatgcaagg 


aacagtgaat 


tggagttcgt 


4560 


cttgttataa 


ttagcttctt 


ggggtatctt 


taaatactgt 


agaaaagagg 


aaggaaataa 


4620 


taaatggcta 


aaatgagaat 


atcaccggaa 


ttgaaaaaac 


tgatcgaaaa 


ataccgctgc 


4680 


gtaaaagata 


cggaaggaat 


gtctcctgct 


aaggtatata' 


agctggtggg 


agaaaatgaa 


4740 


aacctatatt 


taaaaatgac 


ggacagccgg 


tataaaggga 


ccacctatga 


tgtggaacgg 


4800 


gaaaaggaca 


tgatgctatg 


gctggaagga 


aagctgcctg 


ttccaaaggt 


cctgcacttt 


4860 


gaacggcatg 


atggctggag 


caatctgctc 


atgagtgagg 


ccgatggcgt 


cctttgctcg 


4920 


gaagagtatg 


aagatgaaca 


aagccctgaa 


aagattatcg 


agctgtatgc 


ggagtgcatc 


4980 


aggctctttc 


actccatcga 


catatcggat 


tgtccctata 


cgaatagctt 


agacagccgc 


5040 


ttagccgaat 


tggattactt 


actgaataac 


gatctggccg 


atgtggattg 


cgaaaactgg 


5100 


gaagaagaca 


ctccatttaa 


agatccgcgc 


gagctgtatg 


attttttaaa 


gacggaaaag 


5160 


cccgaagagg 


aacttgtctt 


ttcccacggc 


gacctgggag 


acagcaacat 


ctttgtgaaa 


5220 


gatggcaaag 


taagtggctt 


tattgatctt 


gggagaagcg 


gcagggcgga 


caagtggtat 


5280 


gacattgcct 


tctgcgtccg 


gtcgatcagg 


gaggatatcg 


gggaagaaca 


gtatgtcgag 


5340 


ctattttttg 


acttactggg 


gatcaagcct 


gattgggaga 


aaataaaata 


ttatatttta 


5400 


ctggatgaat 


tgttttagta 


cctagatgtg 


gcgcaacgat 


gccggcgaca 


agcaggagcg 


5460 


caccgacttc 


ttccgcatca 


agtgttttgg 


ctctcaggcc 


gaggcccacg 


gcaagtattt 


5520 


gggcaagggg 


tcgctggtat 


tcgtgcaggg 


caagattcgg aataccaagt 


acgagaagga 


5580 


cggccagacg 


gtctacggga 


ccgacttcat 


tgccgataag 


gtggattatc 


tggacaccaa 


5640 
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ggcaccaggc gggtcaaate aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 582 0 

gccccgcgaa accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 6300 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc tgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 

tcagtatcgc tcgggacgca cggcgcgctc tacgaactgc cgataaacag aggattaaaa 7020 

ttgacaattg tgattaaggc tcagattcga cggcttggag cggccgacgt gcaggatttc 7080 

cgcgagatcc gattgtcggc cctgaagaaa gctccagaga tgttcgggtc cgtttacgag 7140 

cacgaggaga aaaagcccat ggaggcgtcc gctgaacggt tgcgagatgc cgcggcattc 7200 

ggcgcctaca tcgacggcga gatcattggg ctgtcggtct tcaaacagga ggacggcccc 7260 

aaggacgctc acaaggcgca tctgtccggc gttttcgtgg agcccgaaca gcgaggccga 7320 
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ggggtcgccg 


gtatgctgct 


gcgggcgttg 


ccggcgggtt 


tattgc tcgt 


gac.gat.cguc 


/joU 


cgacagattc 


caacgggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


7440 


tcgctattct 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgccgggcgg 


ggtcgcggcg 


7500 


acggtaggcg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


ctgccgctct 


gctaggtagc 


7560 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgcggaa 


ctgcgggcgt 


ggcgctgttg 


7620 


gtgttgacac 


caaacgcagc 


gctagatcct 


gtcggcgtcg 


cagcgggcct 


ggcgggggcg 


7680 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


7740 


acctttaccg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


7800 


tttgatccgc 


caatcccgat 


gcctacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


7860 


ctgatcggag. 


cgggtttaac 


ctacttcctt 


tggttccggg 


ggatctcgcg 


actcgaacct 


7920 


acagttgttt 


ccttactggg 


ctttctcagc 


cccagatctg 


gggtcgatca 


gccggggatg 


7980 


catcaggccg 


acagtcggaa 


cttcgggtcc 


ccgacctgta 


ccattcggtg 


agcaatggat 


8040 


aggggagttg 


atatcgtcaa 


cgttcacttc 


taaagaaata 


gcgccactca 


gcttcctcag 


8100 


cggctttatc 


cagcgatttc 


ctattatgtc 


ggcatagttc 


tcaagatcga 


cagcctgtca 


8160 


cggttaagcg 


agaaatgaat 


aagaaggctg 


ataattcgga 


tctctgcgag 


ggagatgata 


8220 


tttgatcaca 


ggcagcaacg 


ctctgtcatc 


gttacaatca 


acatgctacc 


ctccgcgaga 


8280 


tcatccgtgt 


ttcaaacccg 


gcagcttagt 


tgccgttctt 


ccgaatagca 


tcggtaacat 


8340 


gagcaaagtc 


tgccgcctta 


caacggctct 


cccgctgacg 


ccgtcccgga 


ctgatgggct 


8400 


gcctgtatcg 


agtggtgatt 


ttgtgccgag 


ctgccggtcg 


gggagctgtt 


ggctggctgg 


8460 


tggcaggata 


tattgtggtg 


taaacaaatt 


gacgcttaga 


caacttaata 


acacattgcg 


8520 


gacgttttta 


atgtactggg 


gtggtttttc 


ttttcaccag 


tgagacgggc 


aacagctgat 


8580 


tgcccttcac 


cgcctggccc 


tgagagagtt 


gcagcaagcg 


gtccacgctg 


gtttgcccca 


8640 


gcaggcgaaa 


atcctgtttg 


atggtggttc 


cgaaatcggc 


aaaatccctt 


ataaatcaaa 


8700 


agaatagccc 


gagatagggt 


tgagtgttgt 


tccagtttgg 


aacaagagtc 


cactattaaa 


8760 


gaacgtggac 


tccaacgtca 


aagggcgaaa 


aaccgtctat 


cagggcgatg 


gcccactacg 


8820 


tgaaccatca 


cccaaatcaa 


gttttttggg 


gtcgaggtgc 


cgtaaagcac 


taaatcggaa 


8880 


ccctaaaggg 


agcccccgat 


ttagagcttg 


acggggaaag 


ccggcgaacg 


tggcgagaaa 


8940 


ggaagggaag 


aaagcgaaag 


gagcgggcgc 


cattcaggct 


gcgcaactgt 


tgggaagggc 


9000 
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gatcggtgcg 
gattaagttg 
aattaattcc 
ggtattatag 
tcaataactg 
tgtgtaatac 
agcttgggtc 
tcgggagcgg 
tcagcaatat 
ccacagtcga 
tcgccatggg 
agttcggctg 
gcttccatcc 
gtagccggat 
gcaggagcaa 
tcccttcccg 
agccacgata 
ttgacaaaaa 
ccgattgtct 
cctgcgtgca 
tctggattga 
agtggagcat 
acgcgcaata 
tgagtggctc 
gtcatcggcg 
gcgccatcag 
cttaccagag 
gtctagctat 
ttcccttgtc 



ggcctcttcg 
ggtaacgcca 
catcttgaaa 
tccaagcaaa 
attatatcag 
ataaattgat 
ccgctcagaa 
cgataccgta 
cacgggtagc 
tgaatccaga 
tcacgacgag 
gcgcgagccc 
gagtacgtgc 
caagcgtatg 
ggtgagatga 
cttcagtgac 
gccgcgctgc 
gaaccgggcg 
gttgtgccca 
atccatcttg 
gagtgaatat 
ttttgacaag 
atggtttctg 
cttcaacgtt 

ggggtcataa 

atccttggcg 
ggcgccccag 
cgccatgtaa 
cagatagccc 



ctattacgcc 
gggttttccc 
gaaatatagt 
aacataaatt 
ctggtacatt 
gatatagcta 
gaactcgtca 
aagcacgagg 
caacgctatg 
aaagcggcca 
atcctcgccg 
ctgatgctct 
tcgctcgatg 
cagccgccgc 
caggagatcc 
aacgtcgagc 
ctcgtcctgc 
cccctgcgct 
gtcatagccg 
ttcaatccaa 
gagactctaa 
aaatatttgc 
acgtatgtgc 
gcggttctgt 
cgtgactccc 
gcaagaaagc 
ctggcaattc 
gcccactgca 
agtagctgac 



agctggcgaa 
agtcacgacg 
ttaaatattt 
tattgatgca 
gccgtagatg 
gcttagctca 
agaaggcgat 
aagcggtcag 
tcctgatagc 
ttttccacca 
tcgggcatgc 
tcgtccagat 
cgatgtttcg 
attgcatcag 
tgccccggca 
acagctgcgc 
agttcattca 
gacagccgga 
aatagcctct 
gctcccatgg 
ttggataccg 
tagctgatag 
ttagctcatt 
cagttccaaa 
ttaattctcc 
catccagttt 
cggttcgctt 
agctacctgc 
attcatccgg 



agggggatgt 
ttgtaaaacg 
attgataaaa 
agtttaaatt 
aaagactgag 
tcgggggatc 
agaaggcgat 
cccattcgcc 
ggtccgccac 
tgatattcgg 
gcgccttgag 
catcctgatc 
cttggtggtc 
ccatgatgga 
cttcgcccaa 
aaggaacgcc 
gggcaccgga 
acacggcggc 
ccacccaagc 
gccctcgact 
aggggaattt 
tgaccttagg 
aaactccaga 
cgtaaaacgg 
gctcatgatc 
actttgcagg 
gctgtccata 
tttctctttg 
ggtcagcacc 



gctgcaaggc 
acggccagtg 
taacaagtca 
cagaaatatt 
tgcgatatta 
cgtcgaagct 
gcgctgcgaa 
gccaagctct 
acccagccgg 
caagcaggca 
cctggcgaac 
gacaagaccg 
gaatgggcag 
tactttctcg 
tagcagccag 
cgtcgtggcc 
caggtcggtc 
atcagagcag 
ggccggagaa 
agagtcgaga 
atggaacgtc 
cgacttttga 
aacccgcggc 
cttgtcccgc 
ttgatcccct 
gcttcccaac 
aaaccgccca 
cgcttgcgtt 
gtttctgcgg 



9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
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actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt. gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta 1 cacgattata atttcttcat 11520 

agccagcaga tctgccggca tcgatcccgg gccatggcct gctttaatga gatatgcgag 11580 

acgcctatga tcgcatgata tttgctttca attctgttgt gcacgttgta aaaaacctga 11640 

gcatgtgtag ctcagatcct taccgccggt ttcggttcat tctaatgaat atatcacccg 11700 

ttactatcgt atttttatga ataatattct ccgttcaatt tactgattgt ccgtcgacga 117 60 

gctcggcgcg cctctagagg atcgatgaat tcagatcggc tgagtggctc cttcaacgtt 11820 

gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc gtcatcggcg ggggtcataa 11880 

cgtgactccc ttaattctcc gctcatgatc agattgtcgt ttcccgcctt cagtttaaac 11940 

tatcagtgtt tgacaggata tattggcggg taaacctaag agaaaagagc gtttattaga 12000 

ataatcggat atttaaaagg gcgtgaaaag gtttatcctt cgtccatttg tatgtgcatg 12060 

ccaaccacag ggttcccca 12079 
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<223> plant expression vector with two promoter- terminator expression cassettes 
<400> 30 



gatctggcgc 


cggccagcga 


gacgagcaag 


attggccgcc 


gcccgaaacg 


aCCCyaCagc 


fid 


gcgcccagca 


caggtgcgca 


ggcaaattgc 


accaacgcat 


acagcgccag 


CayaatyCCa 


J.ZU 


tagtgggcgg 


tgacgtcgtt 


cgagtgaacc 


agatcgcgca 


ggaggcccgg 


cagcaccggc 


1 on 


ataatcaggc 


cgatgccgac 


agcgtcgagc 


gcgacagtgc 


tcagaattac 


gaccaggggc 


9 zi n 

Z ft u 


atgttgggtt 


tcacgtctgg 


cctccggacc 


agcctccgct 


ggtccgaccg 


aacgcgcgga 


on 


ttctttatca 


ctgataagtt 


ggtggacata 


t tatgc ttat. 


cagtgataaa 


gtgtcaagca 


7 & n 


tgacaaagtt 


gcagccgaat 


acagtgatcc 


gtgccgccct 


ggacctgttg 


aacgaggtcg 




gcgtagacgg 


tctgacgaca 


cgcaaactgg 


cggaacggtt 


gggggttcag 


cagccggcgc 


4t oU 


tttactggca 


cttcaggaac 


aagcgggcgc 


tgctcgacgc 


actggccgaa 


gccatgctgg 


c: / n 


cggagaatca 


tacgcattcg 


gtgccgagag 


ccgacgacga 


ctggcgctca 


tttctgatcg 


c n n 
bUU 


ggaatgcccg 


cagcttcagg 


caggcgctgc 


tcgcctaccg 


cgatggcgcg 


cgcatccatg 


£ c n 
bo U 


ccggcacgcg 


accgggcgca 


ccgcagatgg 


aaacggccga 


cgcgcagctt 


cgcttcctct 


Ton 
/ Z U 


gcgaggcggg 


tttttcggcc 


ggggacgccg 


tcaatgcgct 


gatgacaatc 


agctacttca 


Ton 
/ oU 


ctgttggggc 


cgtgcttgag 


gagcaggccg 


gcgacagcga 


tgccggcgag 


cgcggcggca 


q / n 
o4 U 


ccgttgaaca 


ggctccgctc 


tcgccgctgt 


tgcgggccgc 


gatagacgcc 


t tcgacgaag 


y u u 


ccggtccgga 


cgcagcgttc 


gagcagggac 


tcgcggtgat 


tgtcgatgga 


ttggcgaaaa 


y oU 


ggaggctcgt 


tgtcaggaac 


gttgaaggac 


cgagaaaggg 


tgacgattga 


tcaggaccgc 


1 no n 
±uz u 


tgccggagcg 


caacccactc 


actacagcag 


agccatgtag 


acaacatccc 


^F-a 4— f*\ +- ^ 4~ 

CCCCCCCLtL 


XUoU 


ccaccgcgtc 


agacgcccgt 


agcagcccgc 


tacgggcttt 


ttcatgccct 


gccctagcgt 


1 1 /in 


ccaagcctca 


cggccgcgct 


cggcctctct 


ggcggccttc 


tggcgctctt 


ccgc u ccclc 


1 0 nn 


gctcactgac 


tcgctgcgct 


cggtcgttcg 


gc tgcggcga 


gcggtatcag 


cccac ucaaa 


1 9 a n 
xz 0 u 


ggcggtaata 


cggttatcca 


cagaatcagg 


ggataacgca 


ggaaagaaca 


cgLgagcaaa 




aggccagcaa 


aaggccagga 


accgtaaaaa 


ggccgcgttg 


ctggcgtttt 


tccataggct 


1380 


ccgcccccct 


gacgagcatc 


acaaaaatcg 


acgctcaagt 


cagaggtggc 


gaaacccgac 


1440 


aggactataa 


agataccagg 


cgtttccccc 


tggaagctcc 


ctcgtgcgct 


ctcctgttcc 


1500 


gaccctgccg 


cttaccggat 


acctgtccgc 


ctttctccct 


tcgggaagcg 


tggcgctttt 


1560 


ccgctgcata 


accctgcttc 


ggggtcatta 


tagcgatttt 


ttcggtatat 


ccatcctttt 


1620 
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tcgcacgata 


tacaggattt 


tgccaaaggg 


ttcgtgtaga 


ctttccttgg 


tgtatccaac 


1680 


ggcgtcagcc 


gggcaggata 


ggtgaagtag 


gcccacccgc gagcgggtgt 


tccttcttca 


1740 


ctgtccctta 


ttcgcacctg 


gcggtgctca 


acgggaatcc 


tgctctgcga 


ggctggccgg 


1800 


ctaccgccgg 


cgtaacagat 


gagggcaagc 


ggatggctga 


tgaaaccaag 


ccaaccagga 


1860 


agggcagccc 


acctatcaag 


gtgtactgcc 


ttccagacga 


acgaagagcg 


attgaggaaa 


1920 


aggcggcggc 


ggceggcatg 


agcctgtcgg 


cctacctgct 


ggccgtcggc 


cagggctaca 


1980 


aaatcacggg 


cgtcgtggac 


tatgagcacg 


tccgcgagct 


ggcccgcatc 


aatggcgacc 


2040 


tgggccgcct 


gggcggcctg 


ctgaaactct 


ggctcaccga 


cgacccgcgc 


acggcgcggt 


2100 


tcggtgatgc 


cacgatcctc . 


gccctgctgg 


cgaagatcga 


agagaagcag 


gacgagcttg 


2160 


gcaaggtcat 


gatgggcgtg 


gtccgcccga 


gggcagagcc 


atgacttttt 


tagccgctaa 


2220 


aacggccggg 


gggtgcgcgt 


gattgccaag 


cacgtcccca 


tgcgctccat 


caagaagagc 


2280 


gacttcgcgg 


agctggtgaa 


gtacatcacc 


gacgagcaag 


gcaagaccga 


gcgcctttgc 


2340 


gacgctcacc 


gggctggttg 


ccctcgccgc 


tgggctggcg 


gccgtctatg 


gccctgcaaa 


2400 


cgcgccagaa 


acgccgtcga 


agccgtgtgc 


gagacaccgc ggccgccggc 


gttgtggata 


2460 


cctcgcggaa 


aacttggccc 


tcactgacag 


atgaggggcg 


gacgttgaca 


cttgaggggc 


2520 


cgactcaccc 


ggcgcggcgt 


tgacagatga 


ggggcaggct 


cgatttcggc 


cggcgacgtg 


2580 


gagctggcca 


gcctcgcaaa 


tcggcgaaaa 


cgcctgattt 


tacgcgagtt 


tcccacagat 


2640 


gatgtggaca 


agcctgggga 


taagtgccct 


gcggtattga 


cacttgaggg 


gcgcgactac 


2700 


tgacagatga 


ggggcgcgat 


ccttgacact 


tgaggggcag 


agtgctgaca 


gatgaggggc 


2760 


gcacctattg 


acatttgagg 


ggctgtccac 


aggcagaaaa 


tccagcattt 


gcaagggttt 


2820 


■ccgcccgttt- 


ttcggccacc 


gctaacctgt 


cttttaacct 


gcttttaaac 


caatatttat 


2880 


aaaccttgtt 


tttaaccagg 


gctgcgccct 


gtgcgcgtga 


ccgcgcacgc 


cgaagggggg 


2940 


tgccccccct 


tctcgaaccc 


tcccggcccg 


ctaacgcggg 


cctcccatcc 


ccccaggggc 


3000 


tgcgcccctc 


ggccgcgaac 


ggcctcaccc 


caaaaatggc 


agcgctggca 


gtccttgcca 


3060 


ttgccgggat 


cggggcagta 


acgggatggg 


cgatcagccc 


gagcgcgacg 


cccggaagca 


3120 


ttgacgtgcc 


gcaggtgctg 


gcatcgacat 


tcagcgacca 


ggtgccgggc 


agtgagggcg 


3180 


gcggcctggg 


tggcggcctg 


cccttcactt 


cggccgtcgg 


ggcattcacg 


gacttcatgg 


3240 


cggggccggc 


aatttttacc 


ttgggcattc 


ttggcatagt 


ggtcgcgggt 


gccgtgctcg 


3300 


tgttcggggg 


tgcgataaac 


ccagcgaacc 


atttgaggtg 


ataggtaaga 


ttataccgag 


3360 
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gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 

tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3 840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 402 0 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 

ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggeatg atggctggag caatctgctc atgagtgagg ccgatggcgc cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 
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ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 522 0 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 57 00 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 57 60 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa accttccagt ccgtcggctc gatggtccag- caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 6120 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgtt caeca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agetgeggge 63 00 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gegaagegea cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgecaggae ctgggctggt cgatcaatgg 6420 

ceggtattae aegaaggecg aggaatgect gtcgcgccta caggegaegg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tegtegtget 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc geacegggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gateggatte cacccgcgtg aagaagtggc gcgagcaggt 67 80 
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cggcgaagcc 


tgcgaagagt 


tgcgaggcag 


cggcctggtg 


gaacacgcc t 


gggtcaatga 


O OH U 


tgacctggtg 


cattgcaaac 


gctagggcct 


tgtggggtca 


gttccggctg 


fT ft ft ft T~ t~ ^ ft f* 

ggggttcagc 


o^uu 


agccagcgct 


ttactggcat 


ttcaggaaca 


agcgggcact 


gctcgacgca 


cttgcttcgc 


C Q C C\ 


tcagtatcgc 


tcgggacgca 


cggcgcgctc 


tacgaactgc 


cgataaacag 


aggat taaaa 


/uzu 


ttgacaattg 


tgattaaggc 


tcagattcga 


cggcttggag 


cggccgacgt 


gcagga c l. lc 


/UoU 


cgcgagatcc 


gattgtcggc 


cctgaagaaa 


gctccagaga 


tgttcgggtc 


cgu l. cacgag 


Hi y| n 
/ J_4 U 


cacgaggaga 


aaaagcccat 


ggaggcgttc 


gctgaacggt 


tgcgagatgc 


cgtggcattc 


/ z uu 


ggcgcctaca 


tcgacggcga 


gatcattggg 


ctgtcggtct 


tcaaacagga 


ggacggcccc 


/ Z O U 


aaggacgctc 


acaaggcgca 


tctgtccggc 


gttttcgtgg 


agcccgaaca 


gcgaggccga 


MZ U 


ggggtcgccg 


gtatgctgct 


gcgggcgttg 


ccggcgggtt 


tattgctcgt 


gatgatcgtc 


/jol) 


cgacagattc 


caacgggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


1 A A f\ 


tcgctattct 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgccgggcgg 


ggtcgcggcg 


75 00 


acggtaggcg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


ctgccgctct 


gctaggtagc 


1 C ^ A 

/boO 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgcggaa 


ctgcgggcgt 


ggcgctgttg 


n c o n 
7oz0 


gtgttgacac 


caaacgcagc 


gctagatcct 


gtcggcgtcg 


cagcgggcct 


ggcgggggcg 


n c o n 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


1 "7 /I C\ 

/ / 4 U 


acctttac.cg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


t o n c\ 
/oOO 


tttgatccgc 


caatcccgat 


gcctacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


/ obu 


ctgatcggag 


cgggtttaac 


ctacttcctt 


tggttccggg 


ggatctcgcg 


actcgaacc t 


/ y z u 


acagttgttt 


ccttactggg 


ctttctcagc 


cccagatctg 


gggtcgatca 


gccggggatg 


Toon 

/ y ou 


catcaggccg 


acagtcggaa 


cttcgggtcc 


ccgacctgta 


ccattcggtg 


agcaatggat 


onyifi 
oU4 U 


aggggagttg 


atatcgtcaa 


cgttcacttc 


taaagaaata 


gcgccactca 


gcttcctcag 


olUU 


cggctttatc 


cagcgatttc 


ctattatgtc 


ggcatagttc 


tcaagatcga 


cagcctgtca 


oJ-bll 


cggttaagcg 


agaaatgaat 


aagaaggctg 


ataattcgga 


tctctgcgag 


ggagatgata 


oZZ u 


tttgatcaca 


ggcagcaacg 


ctctgtcatc 


gttacaatca 


acatgctacc 


ctccgcgaga 


8280 


tcatccgtgt 


ttcaaacccg 


gcagcttagt 


tgccgttctt 


ccgaatagca 


tcggtaacat 


8340 


gagcaaagtc 


tgccgcctta 


caacggctct 


cccgctgacg 


ccgtcccgga 


ctgatgggct 


8400 


gcctgtatcg 


agtggtgatt 


ttgtgccgag 


ctgccggtcg 


gggagctgtt 


ggctggctgg 


8460 
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tggcaggata 


tattgtggtg 


taaacaaatt 


gacgcttaga 


caacttaata 


acacattgcg 


8520 


gacgttttta 


atgtactggg 


gtggtttttc 


ttttcaccag 


tgagacgggc 


aacagctgat 


8580 


tgcccttcac 


cgcctggccc 


tgagagagtt 


gcagcaagcg gtccacgctg 


gtttgcccca 


8640 


gcaggcgaaa 


atcctgtttg 


atggtggttc 


cgaaatcggc 


aaaatccctt 


ataaatcaaa 


8700 


agaatagccc 


gagatagggt 


tgagtgttgt 


tccagtttgg 


aacaagagtc 


cactattaaa 


8760 


gaacgtggac 


tccaacgtca 


aagggcgaaa 


aaccgtctat 


cagggcgatg 


gcccactacg 


8820 


tgaaccatca 


cccaaatcaa 


gttttttggg 


gtcgaggtgc 


cgtaaagcac 


taaatcggaa 


8880 


ccctaaaggg 


agcccccgat 


ttagagcttg 


acggggaaag 


ccggcgaacg 


tggcgagaaa 


8940 


ggaagggaag 


aaagcgaaag 


gagcgggcgc 


cattcaggct 


gcgcaactgt 


tgggaagggc 


9000 


gatcggtgcg 


ggcctcttcg 


ctattacgcc 


agctggcgaa 


agggggatgt 


gctgcaaggc 


9060 


gattaagttg 


ggtaacgcca 


gggttttccc 


agtcacgacg 


ttgtaaaacg 


acggccagtg 


9120. 


aattaattcc 


catcttgaaa 


gaaatatagt 


ttaaatattt 


attgataaaa 


taacaagtca 


9180 


ggtattatag 


tccaagcaaa 


aacataaatt 


tattgatgca 


agtttaaatt 


cagaaatatt 


9240 


tcaataactg 


attatatcag 


ctggtacatt 


gccgtagatg* 


aaagactgag 


tgcgatatta 


9300 


tgtgtaatac 


ataaattgat 


gatatagcta 


gcttagctca 


tcgggggatc 


cgtcgaagct 


9360 


agcttgggtc 


ccgctcagaa 


gaactcgtca 


agaaggcgat 


agaaggcgat 


gcgctgcgaa 


9420 


tcgggagcgg 


cgataccgta 


aagcacgagg 


aagcggtcag 


cccattcgcc 


gccaagctct 


9480 


tcagcaatat 


cacgggtagc 


caacgctatg 


tcctgatagc 


ggtccgccac 


acccagccgg 


9540 


ccacagtcga 


tgaatccaga 


aaagcggcca 


ttttccacca 


tgatattcgg 


caagcaggca 


9600 


tcgccatggg 


tcacgacgag 


atcctcgccg 


tcgggcatgc 


gcgccttgag 


cctggcgaac 


9660 


agttcggctg 


gcgcgagccc 


ctgatgctct 


tcgtccagat 


catcctgatc 


gacaagaccg 


9720 


gcttccatcc 


gagtacgtgc 


tcgctcgatg 


cgatgtttcg 


cttggtggtc 


gaatgggcag 


9780 


gtagccggat 


caagcgtatg 


cagccgccgc 


attgcatcag 


ccatgatgga 


tactttctcg 


9840 


gcaggagcaa 


ggtgagatga 


caggagatcc 


tgccccggca 


cttcgcccaa 


tagcagccag 


9900 


tcccttcccg 


cttcagtgac 


aacgtcgagc 


acagctgcgc 


aaggaacgcc 


cgtcgtggcc 


9960 


agccacgata 


gccgcgctgc 


ctcgtcctgc 


agttcattca 


gggcaccgga 


caggtcggtc 


10020 


ttgacaaaaa 


gaaccgggcg 


cccctgcgct 


gacagccgga 


acacggcggc 


atcagagcag 


10080 


ccgattgtct 


gttgtgccca 


gtcatagccg 


aatagcctct 


ccacccaagc 


ggccggagaa 


10140 


cctgcgtgca 


atccatcttg 


ttcaatccaa 


gctcccatgg 


gccctcgact 


agagtcgaga 


10200 
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tctggattga gagtgaatat gagactctaa ttggataccg aggggaattt atggaacgtc 10260 

agtggagcat ttttgacaag aaatatttgc tagctgatag tgaccttagg cgacttttga 10320 

acgcgcaata atggtttctg acgtatgtgc ttagctcatt aaactccaga aacccgcggc 10380 

tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 11100 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 

tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 

aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 

atcacccgtt actatcgtat tcttatgaac aatattctce gttcaattta ctgattgtcc 11760 

gtcgagcaaa tttacacatt gccactaaac gtctaaaccc ttgtaatttg tttttgtttt 11820 

actatgtgtg ttatgtattt gatttgcgat aaatttttat atttggtact aaatttataa 11880 
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caccttttat 


gctaacgttt 


gccaacactt 


agcaatttgc 


aagttgatta 


attgattcta 


11940 


aattattttt 


gtcttctaaa 


tacatatact 


aatcaactgg 


aaatgtaaat 


atttgctaat 


12000 


atttctacta 


taggagaatt 


aaagtgagtg 


aatatggtac 


cacaaggttt 


ggagatttaa 


12060 


ttgttgcaat 


gctgcatgga 


tggcatatac 


accaaacatt 


caataattct 


tgaggataat 


12120 


aatggtacca 


cacaagattt 


gaggtgcatg 


aacgtcacgt 


ggacaaaagg 


tttagtaatt 


12180 


tttcaagaca 


acaatgttac 


cacacacaag 


ttttgaggtg 


catgcatgga 


tgccctgtgg 


12240 


aaagtttaaa 


aatattttgg 


aaatgatttg 


catggaagcc 


atgtgtaaaa 


ccatgacatc 


12300 


cacttggagg 


atgcaataat 


gaagaaaact 


acaaatttac 


atgcaactag 


ttatgcatgt 


12360 


agtctatata 


atgaggattt 


tgcaatactt 


tcattcatac 


acactcacta 


agttttacac 


12420 


gattataatt 


tcttcatagc 


cagcggatcc 


gatatcgggc 


ccgctagcgt 


taaccctgct 


12480 


ttaatgagat 


atgcgagacg. 


cctatgatcg 


catgatattt 


gctttcaatt 


ctgttgtgca 


12540 


cgttgtaaaa 


aacctgragca 


tgtgtagctc 


agatccttac 


cgccggtttc 


ggttcattct 


12600 


aatga'atata 


tcacccgtta 


ctatcgtatt 


tttatgaata 


atattctccg 


ttcaatttac 


12660 


tgattgtccg 


tcgacgaatt 


cgagctcggc 


gcgcctctag* 


aggatcgatg 


aattcagatc 


12720 


ggctgagtgg 


ctccttcaac 


gttgcggttc 


tgtcagttcc 


aaacgtaaaa 


cggcttgtcc 


12780 


cgcgtcatcg 


gcgggggtca 


taacgtgact 


cccttaattc 


tccgctcatg 


atcagattgt 


12840 


cgtttcccgc 


cttcagttta 


aactatcagt 


gtttgacagg 


atatattggc 


gggtaaacct 


12900 


aagagaaaag 


agcgtttatt 


agaataatcg 


gatatttaaa 


agggcgtgaa 


aaggtttatc 


12960 


cttcgtccat 


ttgtatgtgc 


atgccaacca 


cagggttccc 


ca 




13002 



<210> 31 

<211> 13905 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1) . . (12085) 

<223> plant expression vector with three promoter- terminator expression 
cassettes 

<400> 31 

gatctggcgc cggccagcga gacgagcaag attggccgcc gcccgaaacg atccgacagc 60 
gcgcccagca caggtgcgca ggcaaattgc accaacgcat acagcgccag cagaatgcca 120 



tagtgggcgg tgacgtcgtt cgagtgaacc agatcgcgca ggaggcccgg cagcaccggc 180 
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ataatcaggc cgatgccgac agcgtcgagc gcgacagtgc tcagaattac gatcaggggt 240 

atgttgggtt tcacgtctgg cctccggacc agcctccgct ggtccgattg aacgcgcgga 300 

ttctttatca ctgataagtt ggtggacata ttatgtttat cagtgataaa gtgtcaagca 360 

tgacaaagtt gcagccgaat acagtgatcc gtgccgccct ggacctgttg aacgaggtcg 420 

gcgtagacgg tctgacgaca cgcaaactgg cggaacggtt gggggttcag cagccggcgc 480 

tttactggca cttcaggaac aagcgggcgc tgctcgacgc actggccgaa gccatgctgg 540 

cggagaatca tacgcattcg gtgccgagag ccgacgacga ctggcgctca tttctgatcg 600 

ggaatgcccg cagcttcagg caggcgctgc tcgcctaccg cgatggcgcg cgcatccatg 660 

ccggcacgcg accgggcgca ccgcagatgg aaacggccga cgcgcagctt cgcttcctct 720 

gcgaggcggg tttttcggcc ggggacgccg tcaatgcgct gatgacaatc agctacttca 780 

ctgttggggc cgtgcttgag gagcaggccg gcgacagcga tgccggcgag cgcggcggca 840 

ccgttgaaca ggctccgctc tcgccgctgt tgcgggccgc gatagacgcc ttcgacgaag 9.00 

ccggtccgga cgcagcgttc gagcagggac tcgcggtgat tgtcgatgga ttggcgaaaa 960 

ggaggctcgt tgtcaggaac gttgaaggac cgagaaaggg tgacgattga tcaggaccgc 1020 

tgccggagcg caacccactc actacagcag agccatgtag acaacatccc ctcccccttt 1080 

ccaccgcgtc agacgcccgt agcagcccgc tacgggcttt ttcatgccct gccctagcgt 1140 

ccaagcctca cggccgcgct cggcctctct ggcggccttc tggcgctctt ccgcttcctc 1200 

gctcactgac tcgctgcgct cggtcgttcg gctgcggcga gcggtatcag ctcactcaaa 1260 

ggcggtaata cggttatcca cagaatcagg ggataacgca ggaaagaaca tgtgagcaaa 1320 

aggccagcaa aaggccagga accgtaaaaa ggccgcgttg ctggcgtttt tccataggct 1380 

ccgcccccct gacgagcatc acaaaaatcg acgctcaagt cagaggtggc gaaacccgac 1440 

aggactataa agataccagg cgtttccccc tggaagctcc ctcgtgcgct ctcctgttcc ' 1500 

gaccctgccg cttaccggat acctgtccgc ctttctccct tcgggaagcg tggcgctttt 1560 

ccgctgcata accctgcttc ggggtcatta tagcgatttt ttcggtatat ccatcctttt 1620 

tcgcacgata tacaggattt tgccaaaggg ttcgtgtaga ctttccttgg tgtatccaac 1680 

ggcgtcagcc gggcaggata ggtgaagcag gcccacccgc gagcgggtgt tccttcttca 1740 

ctgtccctta ttcgcacctg gcggtgctca acgggaatcc tgctctgcga ggctggccgg 1800 

ctaccgccgg cgtaacagat gagggcaagc ggatggctga tgaaaccaag ccaaccagga 1860 
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agggcagccc acctatcaag gtgtactgcc ttccagacga acgaagagcg attgaggaaa 1920 

aggcggcggc ggccggcatg agcctgtcgg cctacctgct ggccgtcggc cagggctaca 1980 

aaatcacggg cgtcgtggac tatgagcacg tccgcgagct ggcccgcatc aatggcgacc 2040 

tgggccgcct gggcggcctg ctgaaactct ggctcaccga cgacccgcgc acggcgcggt 2100 

tcggtgatgc cacgatcctc gccctgctgg cgaagatcga agagaagcag gacgagcttg 2160 

gcaaggtcat gatgggcgtg gtccgcccga gggcagagcc atgacttttt tagccgctaa 2220 

aacggccggg gggtgcgcgt gattgccaag cacgtcccca tgcgctccat caagaagagc 2280 

gacttcgcgg agctggtgaa gtacatcacc gacgagcaag gcaagaccga gcgcctttgc 2340 

gacgctcacc gggctggttg . ccctcgccgc tgggctggcg gccgtctatg gccctgcaaa 2400 

cgcgccagaa acgccgtcga agccgtgtgc gagacaccgc ggccgccggc gttgtggata 2460 

cctcgcggaa aacttggccc tcactgacag atgaggggcg gacgttgaca cttgaggggc 2520 

cgactcaccc ggcgcggcgt tgacagatga ggggcaggct cgatttcggc cggcgacgtg 2580 

gagctggcca gcctcgcaaa tcggcgaaaa cgcctgattt tacgcgagtt tcccacagat 2 640 

gatgtggaca agcctgggga taagtgccct gcggtattga' cacttgaggg gcgcgactac 2700 

tgacagatga ggggcgcgat ccttgacact tgaggggcag agtgctgaca gatgaggggc 2760 

gcacctattg acatttgagg ggctgtccac aggcagaaaa tccagcattt gcaagggttt 2820 

ccgcccgttt ttcggccacc gctaacctgt cttttaacct gcttttaaac caatatttat 2880 

aaaccttgtt tttaaccagg gctgcgccct gtgcgcgtga ccgcgcacgc cgaagggggg 2940 

tgccccccct tctcgaaccc tcccggcccg ctaacgcggg cctcccatcc ccccaggggc 3000 

tgcgcccctc ggccgcgaac ggcctcaccc caaaaatggc agcgctggca gtccttgcca 3060 

ttgccgggat cggggcagta acgggatggg cgatcagccc gagcgcgacg cccggaagca 3120 

ttgacgtgcc gcaggtgctg gcatcgacat tcagcgacca ggtgccgggc agtgagggcg 3180 

gcggcctggg tggcggcctg cccttcactt cggccgtcgg ggcattcacg gacttcatgg 3240 

cggggccggc aatttttacc ttgggcattc ttggcatagt ggtcgcgggt gccgtgctcg 33 00 

tgttcggggg tgcgataaac ccagcgaacc atttgaggtg ataggtaaga ttataccgag 3360 

gtatgaaaac gagaattgga cctttacaga attactctat gaagcgccat atttaaaaag 3420 

ctaccaagac gaagaggatg aagaggatga ggaggcagat tgccttgaat atattgacaa 3480 

tactgataag ataatatatc ttttatatag aagatatcgc cgtatgtaag gatttcaggg 3540 

ggcaaggcat aggcagcgcg cttatcaata tatctataga atgggcaaag cataaaaact 3600 
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tgcatggact aatgcttgaa acccaggaca ataaccttat agcttgtaaa ttctatcata 3 660 

attgggtaat gactccaact tattgatagt gttttatgtt cagataatgc ccgatgactt 3720 

tgtcatgcag ctccaccgat tttgagaacg acagcgactt ccgtcccagc cgtgccaggt 3780 

gctgcctcag attcaggtta tgccgctcaa ttcgctgcgt atatcgcttg ctgattacgt 3 840 

gcagctttcc cttcaggcgg gattcataca gcggccagcc atccgtcatc catatcacca 3900 

cgtcaaaggg tgacagcagg ctcataagac gccccagcgt cgccatagtg cgttcaccga 3960 

atacgtgcgc aacaaccgtc ttccggagac tgtcatacgc gtaaaacagc cagcgctggc 402 0 

gcgatttagc cccgacatag ccccactgtt cgtccatttc cgcgcagacg atgacgtcac 4080 

tgcccggctg tatgcgcgag gttaccgact gcggcctgag ttttttaagt gacgtaaaat 4140 

cgtgttgagg ccaacgccca taatgcgggc tgttgcccgg catccaacgc cattcatggc 4200 

catatcaatg attttctggt gcgtaccggg ttgagaagcg gtgtaagtga actgcagttg 4260 
ccatgtttta cggcagtgag agcagagata gcgctgatgt ccggcggtgc ttttgccgtt . 4320 

acgcaccacc ccgtcagtag ctgaacagga gggacagctg atagacacag aagccactgg 4380 

agcacctcaa aaacaccatc atacactaaa tcagtaagtt ggcagcatca cccataattg 4440 

tggtttcaaa atcggctccg tcgatactat gttatacgcc aactttgaaa acaactttga 4500 

aaaagctgtt ttctggtatt taaggtttta gaatgcaagg aacagtgaat tggagttcgt 4560 

cttgttataa ttagcttctt ggggtatctt taaatactgt agaaaagagg aaggaaataa 4620 

taaatggcta aaatgagaat atcaccggaa ttgaaaaaac tgatcgaaaa ataccgctgc 4680 

gtaaaagata cggaaggaat gtctcctgct aaggtatata agctggtggg agaaaatgaa 4740 

aacctatatt taaaaatgac ggacagccgg tataaaggga ccacctatga tgtggaacgg 4800 

gaaaaggaca tgatgctatg gctggaagga aagctgcctg ttccaaaggt cctgcacttt 4860 

gaacggcatg atggctggag caatctgctc. atgagtgagg ccgatggcgt cctttgctcg 4920 

gaagagtatg aagatgaaca aagccctgaa aagattatcg agctgtatgc ggagtgcatc 4980 

aggctctttc actccatcga catatcggat tgtccctata cgaatagctt agacagccgc 5040 

ttagccgaat tggattactt actgaataac gatctggccg atgtggattg cgaaaactgg 5100 

gaagaagaca ctccatttaa agatccgcgc gagctgtatg attttttaaa gacggaaaag 5160 

cccgaagagg aacttgtctt ttcccacggc gacctgggag acagcaacat ctttgtgaaa 5220 

gatggcaaag taagtggctt tattgatctt gggagaagcg gcagggcgga caagtggtat 5280 



94 

gacattgcct tctgcgtccg gtcgatcagg gaggatatcg gggaagaaca, gtatgtcgag 5340 

ctattttttg acttactggg gatcaagcct gattgggaga aaataaaata ttatatttta 5400 

ctggatgaat tgttttagta cctagatgtg gcgcaacgat gccggcgaca- agcaggagcg 5460 

caccgacttc ttccgcatca agtgttttgg ctctcaggcc gaggcccacg gcaagtattt 5520 

gggcaagggg tcgctggtat tcgtgcaggg caagattcgg aataccaagt acgagaagga 5580 

cggccagacg gtctacggga ccgacttcat tgccgataag gtggattatc tggacaccaa 5640 

ggcaccaggc gggtcaaatc aggaataagg gcacattgcc ccggcgtgag tcggggcaat 5700 

cccgcaagga gggtgaatga atcggacgtt tgaccggaag gcatacaggc aagaactgat 5760 

cgacgcgggg ttttccgccg aggatgccga aaccatcgca agccgcaccg tcatgcgtgc 5820 

gccccgcgaa. accttccagt ccgtcggctc gatggtccag caagctacgg ccaagatcga 5880 

gcgcgacagc gtgcaactgg ctccccctgc cctgcccgcg ccatcggccg ccgtggagcg 5940 

ttcgcgtcgt ctcgaacagg aggcggcagg. tttggcgaag tcgatgacca tcgacacgcg 6000 

aggaactatg acgaccaaga agcgaaaaac cgccggcgag gacctggcaa aacaggtcag 6060 

cgaggccaag caggccgcgt tgctgaaaca cacgaagcag cagatcaagg aaatgcagct 612 0 

ttccttgttc gatattgcgc cgtggccgga cacgatgcga gcgatgccaa acgacacggc 6180 

ccgctctgcc ctgttcacca cgcgcaacaa gaaaatcccg cgcgaggcgc tgcaaaacaa 6240 

ggtcattttc cacgtcaaca aggacgtgaa gatcacctac accggcgtcg agctgcgggc 63 00 

cgacgatgac gaactggtgt ggcagcaggt gttggagtac gcgaagcgca cccctatcgg 6360 

cgagccgatc accttcacgt tctacgagct ttgccaggac ctgggctggt cgatcaatgg 6420 

ccggtattac acgaaggccg aggaatgcct gtcgcgccta caggcgacgg cgatgggctt 6480 

cacgtccgac cgcgttgggc acctggaatc ggtgtcgctg ctgcaccgct tccgcgtcct 6540 

ggaccgtggc aagaaaacgt cccgttgcca ggtcctgatc gacgaggaaa tcgtcgtgct 6600 

gtttgctggc gaccactaca cgaaattcat atgggagaag taccgcaagc tgtcgccgac 6660 

ggcccgacgg atgttcgact atttcagctc gcaccgggag ccgtacccgc tcaagctgga 6720 

aaccttccgc ctcatgtgcg gatcggattc cacccgcgtg aagaagtggc gcgagcaggt 6780 

cggcgaagcc Lgcgaagagt tgcgaggcag cggcctggtg gaacacgcct gggtcaatga 6840 

tgacctggtg cattgcaaac gctagggcct tgtggggtca gttccggctg ggggttcagc 6900 

agccagcgct ttactggcat ttcaggaaca agcgggcact gctcgacgca cttgcttcgc 6960 
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tcagtatcgc 


tcgggacgca 


cggcgcgctc 


tacgaactgc 


cgataaacag 


aggattaaaa 


7020 


ttgacaattg 


tgattaaggc 


tcagattcga 


cggcttggag 


cggccgacgt 


gcaggatttc 


7080 


cgcgagatcc 


gattgtcggc 


cctgaagaaa 


gctccagaga 


tgttcgggtc 


cgtttacgag 


7140 


cacgaggaga 


aaaagcccat 


ggaggcgttc 


gctgaacggt 


tgcgagatgc 


cgtggcattc 


7200 


ggcgcctaca 


tcgacggcga 


gatcattggg 


ctgtcggtct 


tcaaacagga 


ggacggcccc 


7260 


aaggacgctc 


acaaggcgca 


tctgtccggc 


gttttcgtgg 


agcccgaaca 


gcgaggccga 


7320 


ggggtcgccg 


gtatgctgct 


gcgggcgttg 


ccggcgggtt 


tattgctcgt 


gatgatcgtc 


7380 


cgacagattc 


caacgggaat 


ctggtggatg 


cgcatcttca 


tcctcggcgc 


acttaatatt 


7440 


tcgctattct 


ggagcttgtt 


gtttatttcg 


gtctaccgcc 


tgccgggcgg 


ggtcgcggcg 


7500 


acggtaggcg 


ctgtgcagcc 


gctgatggtc 


gtgttcatct 


ctgccgctct 


gctaggtagc 


7560 


ccgatacgat 


tgatggcggt 


cctgggggct 


atttgcggaa 


ctgcgggcgt 


ggcgctgttg 


7620 


gtgttgacac 


caaacgcagc 


gctagatcct 


gtcggcgtcg 


cagcgggcct 


ggcgggggcg 


7680 


gtttccatgg 


cgttcggaac 


cgtgctgacc 


cgcaagtggc 


aacctcccgt 


gcctctgctc 


7740 


acctttaccg 


cctggcaact 


ggcggccgga 


ggacttctgc 


tcgttccagt 


agctttagtg 


7800 


t.ttgatccgc 


caatcccgat 


gcctacagga 


accaatgttc 


tcggcctggc 


gtggctcggc 


7860 


ctgatcggag 


cgggtttaac 


ctacttcctt 


tggttccggg 


ggatctcgcg 


actcgaacct 


7920 


acagttgttt 


ccttactggg 


ctttctcagc 


cccagatctg 


gggtcgatca 


gccggggatg 


*7 Q q n 

/ y ou 


catcaggccg 


acagtcggaa 


cttcgggtcc 


ccgacctgta 


ccattcggtg 


agcaatggat 


oU4 U 


aggggagttg 


atatcgtcaa 


cgttcacttc 


taaagaaata 


gcgccactca 


gcttcctcag 


8100 


cggctttatc 


cagcgatttc 


ctattatgtc 


ggcatagttc 


tcaagatcga 


cagcctgtca 


8160 


cggttaagcg 


agaaatgaat 


aagaaggctg 


ataattcgga 


tctctgcgag 


ggagatgata 


8220 


tttgatcaca 


ggcagcaacg 


ctctgtcatc 


gttacaatca 


acatgctacc 


ctccgcgaga 


8280 


tcatccgtgt 


ttcaaacccg 


gcagcttagt 


tgccgttctt 


ccgaatagca 


tcggtaacat 


8340 


gagcaaagtc 


tgccgcctta 


caacggctct 


cccgctgacg 


ccgtcccgga 


ctgatgggct 


8400 


gcctgtatcg 


agtggtgatt 


ttgtgccgag 


ctgccggtcg 


gggagctgtt 


ggctggctgg 


8460 


cggcaggata 


tattgtggtg 


taaacaaatt 


gacgcttaga 


caacttaata 


acacattgcg" 


8520 


gacgttttta 


atgtactggg 


gtggtttttc 


ttttcaccag 


tgagacgggc 


aacagctgat 


8580 


tgcccttcac 


cgcctggccc 


tgagagagtt 


gcagcaagcg 


gtccacgctg 


gtttgcccca 


8640 
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gcaggcgaaa 


atcctgtttg 


atggtggttc 


cgaaatcggc 


aaaatccctt 


ataaatcaaa 


8700 


agaatagccc 


gagatagggt 


tgagtgttgt 


tccagtttgg 


aaca.agagtc 


cactattaaa 


8760 


gaacgtggac 


tccaacgtca 


aagggcgaaa 


aaccgtctat 


cagggcgatg 


gcccactacg 


8820 


tgaaccatca 


cccaaatcaa 


gttttttggg 


gtcgaggtgc 


cgtaaagcac 


taaatcggaa 


8880 


ccctaaaggg 


agcccccgat 


ttagagcttg 


acggggaaag 


ccggcgaacg 


tggcgagaaa 


8940 


ggaagggaag 


aaagcgaaag 


gagegggcgc 


cattcaggct 


gcgcaactgt 


tgggaagggc 


9000 


gatcggtgcg 


ggcctcttcg 


ctattacgcc 


agctggcgaa 


agggggatgt 


gctgcaaggc 


9060 


gattaagttg 


ggtaacgcca 


gggttttccc 


agtcacgacg 


ttgtaaaacg 


acggccagtg 


9120 


aattaattcc 


catcttgaaa . 


gaaatatagt 


ttaaatattt 


attgataaaa 


taacaagtca 


9180 


ggtattatag 


tccaagcaaa 


aacataaatt 


tattgatgca 


agtttaaatt 


cagaaatatt 


9240 


tcaataactg 


attatatcag 


ctggtacatt 


gccgtagatg 


aaagactgag 


tgcgatatta 


9300 


tgtgtaatac 


ataaattgat 


gatatagcta 


gcttagctca 


tcgggggatc 


cgtcgaagct 


9360 


agcttgggtc 


ccgctcagaa 


gaactcgtca 


agaaggcgat 


agaaggcgat 


gcgctgcgaa 


9420 


tcgggagcgg 


cgataccgta 


aagcacgagg 


aagcggtcag* 


cccattcgcc 


gccaagctct 


9480 


tcagcaatat 


cacgggtagc 


caacgctatg 


tcctgatagc 


ggtccgccac 


acccagccgg 


9540 


ccacagtcga 


tgaatccaga 


aaagcggcca 


ttttccacca 


tgatattcgg 


caagcaggca 


9600 


tcgccatggg 


tcacgacgag 


atcctcgccg 


tcgggcatgc 


gcgccttgag 


cctggcgaac 


9660 


agttcggctg 


gcgcgagccc 


ctgatgctct 


tcgtccagat 


catcctgatc 


gacaagaccg 


9720 


gcttccatcc 


gagtacgtgc 


tcgctcgatg 


cgatgtttcg 


cttggtggtc 


gaatgggcag 


9780 


gtagccggat 


caagcgtatg 


cagccgccgc 


attgcatcag 


ccatgatgga 


tactttctcg 


9840 


gcaggagcaa 


ggtgagatga 


caggagatcc 


tgccccggca 


cttcgcccaa 


tagcagccag 


9900 


tcccttcccg 


cttcagtgac 


aacgtcgagc 


acagctgcgc 


aaggaacgcc 


cgtcgtggcc 


9960 


agccacgata 


gccgcgctgc 


ctcgtcctgc 


agttcattca 


gggcaccgga 


caggtcggtc 


10020 


ttgacaaaaa 


gaaccgggcg 


cccctgcgct 


gacagccgga 


acacggcggc 


atcagagcag 


10080 


ccgattgtct 


gttgtgccca 


gtcatagccg 


aatagcctct 


ccacccaagc 


ggccggagaa 


10140 


cctgcgtgca 


atccatcttg 


ttcaatccaa 


gctcccatgg 


gccctcgact 


agagtcgaga 


10200 


tctggattga 


gagtgaatat 


gagactctaa 


ttggataccg 




atggaacgtc 


10260 


agtggagcat 


ttttgacaag 


aaatatttgc 


tagctgatag 


tgaccttagg 


cgacttttga 


10320 


acgcgcaata 


atggtttctg 


acgtatgtgc 


ttagctcatt 


aaactccaga 


aacccgcggc 


10380 
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tgagtggctc cttcaacgtt gcggttctgt cagttccaaa cgtaaaacgg cttgtcccgc 10440 

gtcatcggcg ggggtcataa cgtgactccc ttaattctcc gctcatgatc ttgatcccct 10500 

gcgccatcag atccttggcg gcaagaaagc catccagttt actttgcagg gcttcccaac 10560 

cttaccagag ggcgccccag ctggcaattc cggttcgctt gctgtccata aaaccgccca 10620 

gtctagctat cgccatgtaa gcccactgca agctacctgc tttctctttg cgcttgcgtt 10680 

ttcccttgtc cagatagccc agtagctgac attcatccgg ggtcagcacc gtttctgcgg 10740 

actggctttc tacgtgttcc gcttccttta gcagcccttg cgccctgagt gcttgcggca 10800 

gcgtgaagct tgcatgcctg caggtcgacg gcgcgccgag ctcctcgagc aaatttacac 10860 

attgccacta aacgtctaaa cccttgtaat ttgtttttgt tttactatgt gtgttatgta 10920 

tttgatttgc gataaatttt tatatttggt actaaattta taacaccttt tatgctaacg 10980 

tttgccaaca cttagcaatt tgcaagttga ttaattgatt ctaaattatt tttgtcttct 11040 

aaatacatat actaatcaac tggaaatgta aatatttgct aatatttcta ctataggaga 111,00 

attaaagtga gtgaatatgg taccacaagg tttggagatt taattgttgc aatgctgcat 11160 

ggatggcata tacaccaaac attcaataat tcttgaggat aataatggta ccacacaaga 11220 

tttgaggtgc atgaacgtca cgtggacaaa aggtttagta atttttcaag acaacaatgt 11280 

taccacacac aagttttgag gtgcatgcat ggatgccctg tggaaagttt aaaaatattt 11340 

tggaaatgat ttgcatggaa gccatgtgta aaaccatgac atccacttgg aggatgcaat 11400 

aatgaagaaa actacaaatt tacatgcaac tagttatgca tgtagtctat ataatgagga 11460 

ttttgcaata ctttcattca tacacactca ctaagtttta cacgattata atttcttcat 11520 

agccagccca ccgcggtggg cggccgcctg cagtctagaa ggcctcctgc tttaatgaga 11580 

tatgcgagac gcctatgatc gcatgatatt tgctttcaat tctgttgtgc acgttgtaaa 11640 

aaacctgagc atgtgtagct cagatcctta ccgccggttt cggttcattc taatgaatat 11700 

atcacccgtt actatcgtat ttttatgaat aatattctcc gttcaattta ctgattgtcc 11760 

gtcgagcaaa tttacacatt gccactaaac gtctaaaccc ttgtaatttg tttttgtttt 11820 

actatgtgtg ttatgtattt gatttgcgat aaatttttat atttggtact aaatttataa 11880 



caccttttat gctaacgttt gccaacactt agcaatttgc aagttgatta attgattcta 
aattattttt gtcttctaaa tacatatact aatcaactgg aaatgtaaat atttgctaat 
atttctacta taggagaatt aaagtgagtg aatatggtac cacaaggttt ggagatttaa 



11940 
12000 
12060 
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ttgttgcaat 


gctgcatgga 


tggcatatac 


accaaacatt 


caataattct . 


tgaggataat 


1ZXZU 


aatggtacca 


cacaagattt 


gaggtgcatg 


aacgtcacgt 


ggacaaaagg 


tttagtaatt 


1 a 1 o a 
1218U 


tttcaagaca 


acaatgttac 


cacacacaag 


ttttgaggtg 


catgcatgga 


tgccctgtgg 


12240 


aaagtttaaa 


aatattttgg 


aaatgatttg 


catggaagcc 


atgtgtaaaa 


ccatgacatc 


12300 


cacttggagg 


atgcaataat 


gaagaaaact. 


acaaatttac 


atgcaactag 


ttatgcatgt 


1 A "3 £ A 
123 60 


agtctatata 


atgaggattt 


tgcaatactt 


tcattcatac 


acactcacta 


agttttacac 


1 a ^ o a 
1242 0 


gattataatt. 


tcttcatagc 


cagcggatcc 


gatatcgggc 


ccgctagcgt 


taaccctgct 


1 O /I Oft 

12480 


ttaatgagat 


atgcgagacg 


cctatgatcg 


catgatattt 


gctttcaatt 


ctgttgtgca 


T A C /I A 

12 S4 0 


cgttgtaaaa 


aacctgagca 


tgtgtagctc 


agatccttac 


cgccggtttc 


ggttcattct 


12600 


aatgaatata. 


tcacccgtta 


ctatcgtatt 


tttatgaata. 


atattctccg 


ttcaatttac 


12660 


tgattgtccg 


tcgagcaaat 


ttacacattg 


ccactaaacg tctaaaccct 


tgtaatttgt 


12720 


ttttgtttta 


ctatgtgtgt 


tatgtatttg 


att toccata 


aatttttata 


tttggtacta 


12780 


aatttataac 


accttttatg 


ctaacgtttg 


ccaacactta 


gcaatt tgca 


agttgattaa 


12840 


ttgattctaa 


attatttttg 


tcttctaaat 


acatatacta 


atcaactgga 


aatgtaaata 


12900 


tttgctaata 


tttctactat 


aggagaatta 


aagtgagtga 


atatggtacc 


acaaggtttg 


12960 


gagatttaat 


tgttgcaatg 


ctgcatggat 


ggcatataca 


ccaaacattc 


aataattctt 


1 O A A A 

13 02 0 


gaggataata 


atggtaccac 


acaagatttg 


aggtgcatga 


acgtcacgtg 


gacaaaaggt 


1 ~) A O A 

13 080 


ttagtaattt 


ttcaagacaa 


caatgttacc 


acacacaagt 


tttgaggtgc 


atgcatggat 


1 O 1 /I A 

13140 


gccctgtgga 


aagtttaaaa 


atattttgga 


aatgatttgc 


atggaagcca 


tgtgtaaaac 


1 "3 A A A 

13200 


catgacatcc 


acttggagga 


tgcaataatg 


aagaaaacta 


caaatttaca 


tgcaactagt 


i o a a 

13260 


tatgcatgta 


gtctatataa 


tgaggatttt 


gcaatacttt 


cattcataca 


cactcactaa 


"1 O "5 A A 

13320 


gttttacacg 


attataattt 


cttcatagcc 


agcagatctg 


ccggcatcga 


tcccgggcca 


1 O O O A 

133 80 


tggcctgctt 


taatgagata 


tgcgagacgc 


ctatgatcgc 


atgatatttg 


ctttcaattc 


1 O /I /I A 

13440 


tgttgtgcac 


gttgtaaaaa 


acctgagcat 


gtgtagctca 


gatccttacc 


gccggtttcg 


1 T C A A 

13 500 


gttcattcta 


atgaatatat 


cacccgttac 


tatcgtattt 


ttatgaataa 


tattctccgt 


13560 


tcaatttact 


gattgtccgL 


cgacgagctc 


ggcgcgcctc 


tagaggatcg 


atgaattcag 


13620 


atcggctgag 


tggctccttc 


aacgttgcgg 


ttctgtcagt 


tccaaacgta 


aaacggcttg 


13680 


tcccgcgtca 


tcggcggggg 


tcataacgtg 


actcccttaa 


ttctccgctc 


atgatcagat 


13740 
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tgtcgtttcc cgccttcagt ttaaactatc agtgtttgac aggatatatt ggcgggtaaa 13800 
cctaagagaa aagagcgttt attagaataa tcggatattt aaaagggcgt gaaaaggttt 13860 
atccttcgtc catttgtatg tgcatgccaa ccacagggtt cccca 13905 



<210> 32 

<211> 1443 

<212> DNA 

<213> Phaeodactylum tricornutum 
<220> 

<221> CDS 

<222> (9) . . (1442) 

<223> delta-6-desaturase 



<400> 32 

gatctaaa atg ggc aaa gga ggg gac get egg gee teg aag ggc tea acg 50 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr 
1 5 - 10 

gcg get cgc aag ate agt tgg cag gaa gtc aag acc cac gcg tct ccg .98 
Ala Ala Arg Lys lie Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro 
15 20 25 30 

gag gac gee tgg ate att cac tec aat aag gtc tac gac gtg tec aac 146 
Glu Asp Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser Asn 
35 40 45 

tgg cac gaa cat ccc gga ggc gee gtc att ttc acg cac gec ggt gac 194 
Trp His Glu His Pro Gly Gly Ala Val lie Phe Thr His Ala Gly Asp 
50 55 60 

gac atg acg gac att ttc get gee ttt cac gca ccc gga teg cag teg 242 
Asp Met Thr Asp lie Phe Ala' Ala Phe His Ala Pro Gly Ser Gin Ser 
65 70 75 

etc atg aag aag ttc tac att ggc gaa ttg etc ccg gaa acc acc ggc 290 
Leu Met Lys Lys Phe Tyr lie Gly Glu Leu Leu Pro Glu Thr Thr Gly 
80 85 90 

aag gag ccg cag caa ate gec ttt gaa aag ggc tac cgc gat ctg cgc 338 
Lys Glu Pro Gin Gin lie Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg 
95 100 105 110 

tec aaa etc ate atg atg ggc atg ttc aag tec aac aag tgg ttc tac 386 
Ser Lys Leu lie Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe Tyr 
115 120 125 

gtc tac aag cgc etc age aac atg gec att tgg gec gec gec tgt get 434 
Val Tyr Lys Cys Leu Ser Asn Met Ala lie Trp Ala Ala Ala Cys Ala 
130 135 140 



etc gtc ttt tac teg gac cgc ttc tgg gta cac ctg gee age gec gtc 
Leu Val Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val 



482 



100 



145 150 155 

atg ctg gga aca ttc ttt cag cag teg gga tgg ttg .gca cac gac ttt 530 
Met Leu Gly'Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp Phe 
160 165 170 

ctg . cac cac cag gtc ttc acc aag cgc aag cac ggg gat etc gga gga 578 
Leu His His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly 
175 180 185 190 

etc ttt tgg ggg aac etc atg cag ggt tac tec gta cag tgg tgg aaa 62 6 

Leu Phe Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp Lys 
195 200 205 

aac aag cac aac gga cac cac gee gtc ccc aac etc cac tgc tec tec 674 
Asn Lys His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser 
210 215 220 

gca gtc gcg caa gat ggg gac ccg gac ate gat acc atg ccc ctt etc 722 
Ala Val Ala Gin Asp Gly Asp Pro Asp lie Asp Thr Met Pro Leu Leu 
225 230 235 

gee tgg tec gtc cag caa gee cag tct tac egg gaa etc caa gee gac 770 
Ala Trp Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala Asp 
240 245 250 

gga aag gat teg ggt ttg gtc aag ttc atg ate cgt aac caa tec tac 818 
Gly. Lys Asp Ser Gly Leu Val, Lys. Phe Met lie Arg Asn Gin Ser Tyr 
255 260 265 270 

ttt tac ttt ccc ate ttg. ttg etc gee cgc ctg teg tgg ttg aac gag 866 
Phe Tyr Phe Pro lie Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu 
275 280 285 

tec ttc aag tgc gee ttt ggg ctt gga get gcg teg gag aac get get 914 
Ser Phe Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala 
290 295 300 

etc gaa etc aag gee aag ggt ctt cag tac ccc ctt ttg gaa aag get 962 
Leu Glu Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys Ala 
305 310 315 

ggc ate ctg ctg cac tac get tgg atg ctt aca gtt teg tec ggc ttt 1010 
Gly lie Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe 
320 325 330 

gga cgc ttc teg ttc gcg tac acc gca ttt tac ttt eta acc gcg acc 1058 
Gly Arg Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr 
335 340 345 350 

gcg tec tgt gga ttc ttg etc gee att gtc ttt ggc etc ggc cac aac 1106 
Ala Ser Cys Gly Phe Leu Leu Ala lie Val Phe Gly Leu Gly His Asn 
355 360 365 



ggc atg gee acc tac aat gec gac gee cgt ccg gac ttc tgg aag etc 
Gly Met Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu 
370 375 380 



1154 
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caa gtc acc acg act cgc aac gtc acg ggc gga cac ggt ttc ccc caa 1202 
Gin Val Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin 
385 390 395 

gcc ttt gtc gac tgg ttc tgt ggt ggc etc cag tac caa gtc gac cac 1250 
Ala Phe Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His 
400 405 410 

cac tta ttc ccc age ctg ccc cga cac aat ctg gcc aag aca cac gca 1298 
His Leu Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr. His Ala 
415 420 425 430 

ctg gtc gaa teg ttc tgc aag gag tgg ggt gtc cag tac cac gaa gcc 1346 
Leu Val Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala 
435 440 445 

gac ctt gtg gac ggg acc atg gaa gtc . ttg cac cat ttg ggc age gtg 1394 
Asp Leu Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser Val 
450 455 460 

gcc ggc gaa ttc gtc gtg gat ttt gta cgc gat gga ccc gcc atg taa a 1443 
Ala Gly Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465 470 475 



<210> 33 

<211> 477 

<212> PRT 

<213> Phaeodactylum tricornutum 

<400> 33 

Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
1 5 10 15 



Arg Lys lie Ser Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 
20 .25 30 



Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 
35 40 45 



Glu His Pro Gly Gly Ala Val lie Phe Thr His Ala Gly Asp Asp Met 
50 55 60 



Thr Asp lie Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 



Lys Lys Phe Tyr lie Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 
85 90 95 
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Pro Gin Gin lie. Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 
100 105 110 



Leu lie Met. Met Gly Met Phe Lys Ser Asn Lys Trp Phe Tyr Val Tyr 
115 . 120 125 



Lys Cys Leu Ser Asn Met Ala lie Trp Ala Ala Ala Cys Ala Leu Val 
130 135 140 



Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val Met Leu 
145 . . 150 155 160 



Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 



His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly Leu Phe 
180 185 190 



Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp Lys Asn Lys 
. ~ 195 200 205 



His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser Ala Val 
210 215 220 



Ala Gin Asp Gly Asp Pro Asp lie Asp Thr Met Pro Leu Leu Ala Trp 
225 230 235 240 



Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala Asp Gly Lys 
245 250 255 



Asp Ser Gly Leu Val Lys Phe Met lie Arg Asn Gin Ser Tyr Phe Tyr 
260 265 270 



Phe Pro lie Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu Ser Phe 
275' 280 285 



Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala Leu Glu 
290 295 300 



Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys Ala Gly lie 
305 310 315 320 



Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe Gly Arg 
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325 330 335 



Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr Ala Ser 
340 345 350 



Cys Gly Phe Leu Leu Ala He Val Phe Gly Leu Gly His Asn Gly Met 
355 360 365 



Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gin Val 
370 375 380 



Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin Ala Phe 
385 390 395 400 



Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His His Leu 
405 410 415 



Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 
420 425 430 



Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His Glu Ala Asp Leu 
435 440 445 



Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly 
450 455 460 



Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465 470 475 



<210> 34 . 

<211> 17061 

<212> DNA 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 
<220> 

<221> CDS 

<222> (4554) (5987) 

<223> Phaeodactylum tricornutum delta-6-desaturase 



<220> 

<221> CDS 

<222> (2805) . . (3653) 

<223> Caenorhabditis elegans LPLAT 



<220> 
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<221> CDS 

<222> (1026) . . (1898) 

<223> Physcomitrella patens delta-6-elongase 



<400> 34 

tggggaaccc tgtggttggc atgcacatac aaatggacga aggataaacc ttttcacgcc 60 

cttttaaata tccgattatt ctaataaacg ctcttttctc ttaggtttac ccgccaatat 120 

atcctgtcaa acactgatag tttaaactga aggcgggaaa cgacaatctg atcatgagcg 180 

gagaattaag ggagtcacgt tatgaccccc gccgatgacg cgggacaagc cgttttacgt 240 

ttggaactga cagaaccgca acgttgaagg agccactcag ccgatctgaa ttcatcgatc 3 00 

ctctagaggc gcgccgagct cctcgagcaa atttacacat tgccactaaa cgtctaaacc 360 

cttgtaattt gtttttgttt tactatgtgt gttatgtatt tgatttgcga taaattttta 420 

tatttggtac taaatttata acacctttta tgctaacgtt tgccaacact tagcaatttg 480 

. caagttgatt aattgattct aaattatttt tgtcttctaa atacatatac taatcaactg 540 

gaaatgtaaa tatttgctaa tatttctact ataggagaat taaagtgagt gaatatggta 600 

ccacaaggtt tggagattta attgttgcaa tgctgcatgg' atggcatata caccaaacat 660 

tcaataattc ttgaggataa taatggtacc acacaagatt tgaggtgcat gaacgtcacg 720 

tggacaaaag gtttagtaat ttttcaagac aacaatgtta ccacacacaa gttttgaggt 780 

gcatgcatgg atgccctgtg gaaagtttaa aaatattttg gaaatgattt gcatggaagc 840 

catgtgtaaa accatgacat ccacttggag gatgcaataa tgaagaaaac tacaaattta 900 

catgcaacta gttatgcatg tagtctatat aatgaggatt ttgcaatact ttcattcata 960 

cacactcact aagttttaca cgattataat ttcttcatag ccagcccacc gcggtgggcg 1020 

gccgc atg gag gtc gtg gag aga ttc tac ggt gag ttg gat ggg aag gtc 1070 
Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val 
1 5 10 15 

teg cag ggc gtg aat gca ttg ctg ggt agt ttt ggg gtg gag ttg acg 1118 
Ser Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr 
20 25 30 

gat acg ccc act acc aaa ggc ttg ccc etc gtt gac agt ccc aca ccc 1166 
Asp Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro 
35 40 45 

ate gtc etc ggt gtt tct gta tac ttg act att gtc att gga ggg ctt 1214 
He Val Leu Gly Val Ser Val Tyr Leu Thr He Val He Gly Gly Leu 
50 55 60 



ttg tgg ata aag gee agg gat ctg aaa ccg cgc gee teg gag cca ttt 
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Leu Trp lie Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe 
65 70 75 

ttg etc caa get ttg gtg ctt gtg cac aac ctg ttc tgt ttt gcg etc 1310 
Leu Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu 
80 85 90 95 

agt ctg tat atg tgc gtg ggc ate get tat cag get att ace tgg egg 135 8 

Ser Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg 
100 105 110 

tac tct etc tgg ggc aat gca tac aat cct aaa cat aaa gag atg gcg 1406 
Tyr Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala 
115 120 125 

att ctg gta tac ttg ttc tac atg tct aag tac gtg gaa ttc atg gat 1454 
lie Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp 
130 135 . 140 

ace gtt ate atg ata ctg aag cgc age acc agg caa ata age ttc etc 1502 
Thr Val lie Met lie Leu Lys Arg Ser Thr Arg Gin lie Ser Phe Leu 
145 150 155 

cac gtt tat cat cat tct tea att tec etc att tgg tgg get att get 15.50 
His Val Tyr His His Ser Ser lie Ser Leu lie Trp Trp Ala lie Ala 
160 165 170 175 

cat cac get cct ggc ggt gaa gca tat tgg tct gcg get ctg aac tea 1598 
His His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser 
180 185 190 

gga gtg cat gtt etc atg tat gcg tat tac ttc ttg get gee tgc ctt 1646 
Gly Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu 
195 200 205 

cga agt age cca aag tta aaa aat aag tac ctt ttt tgg ggc agg tac 1694 
Arg Ser Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr 
210 215 220 

ttg aca caa ttc caa atg ttc cag ttt atg ctg aac tta gtg cag get 1742 
Leu Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala 
225 230 . 235 

tac tac gac atg aaa acg aat gcg cca tat cca caa tgg ctg ate aag 1790 
Tyr Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys 
240 245 250 255 

att ttg ttc tac tac atg ate teg ttg ctg ttt ctt ttc ggc aat ttt 1838 
lie Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe 
260 265 270 

tac gta caa aaa tac ate aaa ccc tct gac gga aag caa aag gga get 1886 
Tyr Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala 
275 280 285 

aaa act gag tga tctagaaggc ctcctgcttt aatgagatat gcgagacgcc 1938 
Lys Thr Glu 
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tatgatcgca tgatatttgc tttcaattct gttgtgcacg ttgt.aaaaaa cctgagcatg 1998 

tgtagctcag atccttaccg ccggtttcgg ttcattctaa tgaatatatc acccgttact 2058 

atcgtatttt tatgaataat attctccgtt caatttactg attgtccgtc gagcaaattt 2118 

acacattgcc actaaacgtc taaacccttg taatttgttt ttgttttact atgtgtgtta 2178 

tgtatttgat ttgcgataaa tttttatatt tggtactaaa tttataacac cttttatgct 2238 

aacgtttgcc aacacttagc aatttgcaag ttgattaatt gattctaaat tatttttgtc 2298 

ttctaaatac atatactaat caactggaaa tgtaaatatt tgctaatatt tctactatag 2358 

gagaattaaa gtgagtgaat . atggtaccac aaggtttgga gatttaattg ttgcaatgct 2418 

gcatggatgg catatacacc aaacattcaa taattcttga ggataataat ggtaccacac 2478 

aagatttgag gtgcatgaac gtcacgtgga caaaaggttt agtaattttt caagacaaca 2538 

atgttaccac acacaagttt tgaggtgcat gcatggatgc cctgtggaaa gtttaaaaat 2598 

attttggaaa tgatttgcat ggaagccatg tgtaaaacca tgacatccac ttggaggatg 2658 

caataatgaa gaaaactaca aatttacatg caactagtta 1 tgcatgtagt ctatataatg 2718 

aggattttgc aatactttca ttcatacaca ctcactaagt tttacacgat tataatttct 2778 

tcatagccag cggatccgcc cacata atg gag aac ttc tgg tct att gtt gtg 2831 

Met Glu Asn Phe Trp Ser lie Val Val 
295 

ttt ttt eta etc tea att etc ttc att tta tat aac ata teg aca gta 2879 
Phe Phe Leu Leu Ser lie Leu Phe lie Leu Tyr Asn lie Ser Thr Val 
300. 305 310 315 

tgc cac tac tat atg egg att teg ttt tat tac ttc aca att tta ttg 2927 
Cys His Tyr Tyr Met Arg lie Ser Phe Tyr Tyr Phe Thr lie Leu Leu 
320 325 330 

cat gga atg gaa gtt tgt gtt aca atg ate cct tct tgg eta aat ggg 2975 
His Gly Met Glu Val Cys Val Thr Met lie Pro Ser Trp Leu Asn Gly 
335 340 345 

aag ggt get gat tac gtg ttt cac teg ttt ttc tat tgg tgt aaa tgg 3023 
Lys Gly Ala Asp Tyr Val Phe His Ser Phe Phe Tyr Trp Cys Lys Trp 
350 355 360 

act ggt gtt cat aca aca gtc tat gga tat gaa aaa aca caa gtt gaa 3071 
Thr Gly Val His Thr Thr Val Tyr Gly Tyr Giu Lys Thr Gin Val Glu 
365 370 375 

ggt ccg get gta gtt att tgt aat cat cag agt tct etc gac att eta 3119 
Gly Pro Ala Val Val lie Cys Asn His Gin Ser Ser Leu Asp lie Leu 
380 385 390 395 
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teg atg gca tea , ate tgg ccg aag aat tgt gtt gta atg atg aaa cga 3167 
Ser Met Ala Ser lie Trp Pro Lys Asn Cys Val Val Met Met Lys Arg 
400 405 410 

att ctt gec tat gtt cca ttc ttc aat etc gga gee tac ttt tec aac 3215 
lie Leu Ala Tyr Val Pro Phe Phe Asn Leu Gly Ala Tyr Phe Ser Asn 
415 420 425 

aca ate ttc ate gat cga tat aac cgt gaa cgt gcg atg get tea gtt 32 63 

Thr lie Phe lie Asp Arg Tyr Asn Arg Glu Arg Ala Met Ala Ser Val 
430 435 440 

gat tat tgt gca tct gaa atg aag aac aga aat ctt aaa ctt tgg gta 3311 
Asp Tyr Cys Ala Ser Glu Met Lys Asn Arg Asn Leu Lys Leu Trp Val 
445 450 455 

ttt ccg gaa gga aca aga aat cgt gaa. gga ggg ttc att cca ttc aag 3359 
Phe Pro Glu Gly Thr Arg Asn Arg Glu Gly Gly Phe lie Pro Phe Lys 
460 465 470 475 

aaa gga gca ttc aat att gca gtt cgt gcg cag att ccc att att cca 3407 
Lys Gly Ala Phe Asn He Ala Val Arg Ala Gin He Pro He He Pro 
480 485 490 

gtt gta ttc tea gac tat egg gat ttc tac tea aag cca ggc cga tat 3455 
Val Val Phe Ser Asp Tyr Arg Asp Phe Tyr Ser Lys Pro Gly Arg Tyr 
495 500 505 

ttc aag aat gat gga gaa gtt gtt att cga gtt ctg gat gcg att cca 3503 
Phe Lys Asn Asp Gly Glu Val Val He Arg Val Leu Asp Ala He Pro 
510 515 520 

aca aaa ggg etc act ctt gat gac gtc age gag ttg tct gat atg tgt 3551 
Thr Lys Gly Leu Thr Leu Asp Asp Val Ser Glu Leu Ser Asp Met Cys 
525 530 535 

egg gac gtt atg ttg gca gee tat aag gaa gtt act eta gaa get cag 3599 
Arg Asp Val Met Leu Ala Ala Tyr Lys Glu Val Thr Leu Glu Ala Gin 
540 545 550 555 

caa cga aat gcg aca egg cgt gga gaa aca aaa gac ggg aag aaa tct 3647 
Gin Arg Asn Ala Thr Arg Arg Gly Glu Thr Lys Asp Gly Lys Lys Ser 
560 565 570 

gag taa getagegtta accctgcttt aatgagatat gcgagacgcc tatgatcgea 3703 
Glu 



tgatatttgc tttcaattct gttgtgcacg ttgtaaaaaa cctgagcatg tgtagctcag 3763 

atccttaccg ccggtttcgg ttcattctaa tgaatatatc acccgttact ategtatttt 3823 

tatgaataat attctcegtt caatttactg attgtccgtc gagcaaattt acacattgcc 3883 

actaaaegtc taaacccttg taatttgttt ttgttttact atgtgtgtta tgtatttgat 3943 
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ttgcgataaa tttttatatt tggtactaaa tttataacac cttttatgct aacgtttgcc 4003 

aacacttagc aatttgcaag ttgattaatt gattctaaat tatt.tttgtc ttctaaatac 4063 

atatactaat caactggaaa tgtaaatatt tgctaatatt tctactatag gagaattaaa 4123 

gtgagtgaat atggtaccac aaggtttgga gatttaattg ttgcaatgct gcatggatgg 4183 

catatacacc aaacattcaa : taattcttga ggataataat ggtaccacac aagatttgag 4243 

gtgcatgaac gtcacgtgga caaaaggttt agtaattttt caagacaaca atgttaccac 4303 

acacaagttt tgaggtgcat gcatggatgc cctgtggaaa gtttaaaaat attttggaaa 43 63 

tgatttgcat ggaagccatg tgtaaaacca tgacatccac ttggaggatg caataatgaa 4423 

gaaaactaca aatttacatg. caactagtta tgcatgtagt ctatataatg aggattttgc 4483 

aatactttca ttcatacaca ctcactaagt tttacacgat tataatttct tcatagccag 4543 

cagatctaaa atg ggc aaa gga ggg gac get egg gee teg aag ggc tea 4592 
Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser 
575 580 585 

acg gcg get cgc aag ate agt tgg cag gaa gtc aag ace cac gcg tct 4640 
Thr Ala Ala Arg Lys He Ser Trp Gin Glu Val Lys Thr His Ala Ser 
590 595 • 600 

ccg gag gac gee tgg ate att cac tec aat aag gtc tac gac gtg tec 4688 
Pro Glu Asp Ala Trp He He His Ser Asn Lys Val Tyr Asp Val Ser 
605 610 615 

aac tgg cac gaa cat ccc gga ggc gee gtc att ttc acg cac gee ggt 473 6 

Asn Trp His Glu His Pro Gly Gly Ala Val lie Phe Thr His Ala Gly 
620 625 630 

gac gac atg acg gac att ttc get gee ttt cac gca ccc gga teg cag 4784 
Asp Asp Met Thr Asp He Phe Ala Ala Phe His Ala Pro Gly Ser Gin 
635 640 645 

teg etc atg aag aag ttc tac. att ggc gaa ttg etc ccg gaa ace acc 4832 
Ser Leu Met Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr 
650 655 660 665 

ggc aag gag ccg cag caa ate gee ttt gaa aag ggc tac cgc gat ctg 4880 
Gly Lys Glu Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu 
670 675 680 

cgc tec aaa etc ate atg atg ggc atg ttc aag tec aac aag tgg ttc 4928 
Arg Ser Lys Leu He Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe 
685 690 695 

tac gtc tac aag tgc etc age aac atg gee att tgg gee gee gee tgt 4976 
Tyr Val Tyr Lys Cys Leu Ser Asn Met Ala He Trp Ala Ala Ala Cys 
700 705 710 

get etc gtc ttt tac teg gac cgc ttc tgg gta cac ctg gee age gee 5024 
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Ala Leu Val Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala 
715 720 725 

gtc atg ctg gga aca ttc ttt cag cag teg gga tgg ttg gca cac gac 5072 
Val Met Leu Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp 
730 735 740 745 

ttt ctg cac cac cag gtc ttc acc aag cgc aag cac ggg gat etc gga 5120 
Phe Leu His His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly 
750 755 760 

gga etc ttt tgg ggg aac etc atg cag ggt tac tec gta cag tgg tgg 5168 
Gly Leu Phe Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp 
765 770 775 

aaa aac aag cac aac gga cac cac gec gtc ccc aac etc cac tgc tec 5216 
Lys Asn Lys His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser 
780 785 ' 790 

tec gca gtc gcg caa gat ggg gac ccg gac ate gat acc atg ccc ctt 5264 
Ser Ala Val Ala Gin Asp Gly Asp Pro Asp lie Asp Thr Met Pro Leu 
795 800 805 

etc gee tgg tec gtc cag caa gee cag tct tac egg gaa etc caa gee 5312 
Leu Ala Trp Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala 
810 815 820 825 

gac gga aag gat teg ggt ttg gtc aag ttc atg ate cgt aac caa tec 53 60 

Asp Gly Lys Asp Ser Gly Leu Val Lys Phe Met lie Arg Asn Gin Ser 
830 835 840 

tac ttt tac ttt ccc ate ttg ttg etc gee cgc ctg teg tgg ttg aac 5408 
Tyr Phe Tyr Phe Pro lie Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn 
845 850 855 

gag tec ttc aag tgc gee ttt ggg ctt gga get gcg teg gag aac get 5456 
Glu Ser Phe Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala 
860 865 870 

get etc gaa etc aag gee aag ggt ctt cag tac ccc ctt ttg gaa aag 5504 
Ala Leu Glu Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys 
875 880 885 

get ggc ate ctg ctg cac tac get tgg atg ctt aca gtt teg tec ggc 5552 
Ala Gly lie Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly 
890 895 900 905 

ttt gga cgc ttc teg ttc gcg tac acc gca ttt tac ttt eta acc gcg 5600 
Phe Gly Arg Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala 
910 915 920 

acc gcg tec tgt gga ttc ttg etc gee att gtc ttc ggc etc ggc cac 5648 
Thr Ala Ser Cys Gly Phe Leu Leu Ala lie Val Phe Gly Leu Gly His 
925 930 935 

aac ggc atg gee acc tac aat gee gac gee cgt ccg gac ttc tgg aag 5696 
Asn Gly Met Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys 
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940 945 950 

etc caa gtc acc acg act cgc aac gtc acg ggc gga cac ggt ttc ccc 5744 
Leu Gin Val Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro 
955 960 965 

caa gec ttt gtc.gac tgg ttc tgt ggt ggc etc cag tac caa gtc gac 5792 
Gin Ala Phe Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp 
970 .. 975 980 985 

cac cac tta ttc ccc age ctg ccc cga cac aat ctg gec aag aca cac 5840 
His His Leu Phe . Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His 
990 995 1000 

gca ctg gtc gaa teg ttc tgc aag gag tgg ggt gtc cag tac cac 5885 
Ala Leu Val Glu Ser Phe Cys Lys Glu Trp Gly Val Gin Tyr His 
1005 1010 1015 

gaa gee gac ctt gtg gac ggg acc atg gaa gtc ttg cac cat ttg 5930 
Glu Ala Asp Leu Val Asp Gly Thr Met Glu Val Leu His His Leu 
1020 1025 1030 

ggc age gtg gec ggc gaa ttc gtc gtg gat ttt gta cgc gat gga 5975 
Gly Ser Val Ala Gly Glu Phe Val Val Asp Phe Val Arg Asp Gly 
1035 1040 1045 

ccc gec atg taa agatctgecg gcatcgatcc egggecatgg ectgetttaa 6027 
Pro Ala Met 



tgagatatgc gagacgecta tgatcgcatg atatttgett tcaattctgt tgtgcacgtt 6087 

gtaaaaaacc tgagcatgtg tagctcaga.t ccttaccgcc ggtttcggtt cattctaatg 6147 

aatatatcac ccgttactat cgtattttta tgaataatat tctccgttca atttactgat 6207 

tgtcegtega cgagctcggc gcgccgtcga cctgcaggca tgeaagctte acgctgccgc 6267 

aagcactcag ggcgcaaggg ctgetaaagg aageggaaca egtagaaage cagtccgcag 6327 

aaacggtgct gaccccggat gaatgtcagc tactgggcta tctggacaag ggaaaacgea 6387 

agegcaaaga gaaagcaggt agettgeagt gggcttacat ggegataget agactgggcg 6447 

gttttatgga cagcaagega aceggaattg ccagctgggg cgccctctgg taaggttggg 6507 

aagccctgca aagtaaactg gatggctttc ttgccgccaa ggatctgatg gcgcagggga 6567 

tcaagatcat gageggagaa ttaagggagt cacgttatga cccccgccga tgacgcggga 6627 

caagccgttt tacgtttgga actgacagaa ccgcaacgtt gaaggageca ctcagccgcg 6687 

ggtttctgga gtttaatgag ctaagcacat aegtcagaaa ecattattge gcgttcaaaa 6747 

gtcgcctaag gtcactatca gctagcaaat atttcttgtc aaaaatgetc cactgaegtt 6807 

ccataaattc ccctcggtat ccaattagag tctcatattc actctcaatc cagatctcga 6867 
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ctctagtcga gggcccatgg gagcttggat tgaacaagat ggattgcacg caggttctcc 6927 

ggccgcttgg gtggagaggc tattcggcta tgactgggca caacagacaa tcggctgctc 6987 

tgatgccgcc gtgttccggc tgtcagcgca ggggcgcccg gttctttttg tcaagaccga 7047 

cctgtccggt gccctgaatg aactgcagga cgaggcagcg cggctatcgt ggctggccac 7107 

gacgggcgtt ccttgcgcag ctgtgctcga cgttgtcact gaagcgggaa gggactggct 7167 

gctattgggc gaagtgccgg ggcaggatct cctgtcatct caccttgctc ctgccgagaa 7227 

agtatccatc atggctgatg caatgcggcg gctgcatacg cttgatccgg ctacctgccc 7287 

attcgaccac caagcgaaac atcgcatcga gcgagcacgt actcggatgg aagccggtct 7347 

tgtcgatcag gatgatctgg acgaagagca tcaggggctc gcgccagccg aactgttcgc 7407 

caggctcaag gcgcgcatgc ccgacggcga ggatctcgtc gtgacccatg gcgatgcctg 7467 

cttgccgaat atcatggtgg aaaatggccg cttttctgga ttcatcgact gtggccggct 7527 

gggtgtggcg gaccgctatc aggacatagc gttggctacc cgtgatattg ctgaagagct 75.87 

tggcggcgaa tgggctgacc gcttcctcgt gctttacggt atcgccgctc ccgattcgca 7647 

gcgcatcgcc ttctatcgcc ttcttgacga gttcttctga gcgggaccca agctagcttc 7707 

gacggatccc ccgatgagct aagctagcta tatcatcaat ttatgtatta cacataatat 7767 

cgcactcagt ctttcatcta cggcaatgta ccagctgata taatcagtta ttgaaatatt 7827 

tctgaatt.ta aacttgcatc aataaattta tgtttttgct tggactataa tacctgactt 7887 

gttattttat caataaatat ttaaactata tttct.ttcaa gatgggaatt aattcactgg 7947 

ccgtcgtttt acaacgtcgt gactgggaaa acectggcgt tacccaactt aatcgccttg 8007 

cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt 8067 

cccaacagtt gcgcagcctg aatggcgccc gctcctttcg ctttcttccc ttcctttctc 8127 

gccacgttcg ccggctttcc ccgtcaagct ctaaatcggg ggctcccttt agggttccga 8187 

tttagtgctt tacggcacct cgaccccaaa aaacttgatt tgggtgatgg ttcacgtagt 8247 

gggccatcgc cctgatagac ggtttttcgc cctttgacgt tggagtccac gttctttaat 8307 

agtggactct tgttccaaac tggaacaaca ctcaacccta tctcgggcta ttcttttgat 8367 

ttataaggga ttttgccgat ttcggaacca ccatcaaaca ggattttcgc ctgctggggc 8427 

aaaccagcgt ggaccgcttg ctgcaactct ctcagggcca ggcggtgaag ggcaatcagc 8487 

tgttgcccgt ctcactggtg aaaagaaaaa ccaccccagt acattaaaaa cgtccgcaat 8547 
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gtgttattaa gttgtctaag cgtcaatttg tttacaccac aatatatcct gccaccagcc 8607 

agccaacagc tccccgaccg gcagctcggc acaaaatcac cactegatac aggcagccca 8667 

tcagtccggg acggcgtcag cgggagagcc gttgtaaggc ggcagacttt gctcatg.tta 8727 

ccgatgctat tcggaagaac ggcaactaag ctgccgggtt tgaaacacgg atgatctcgc 87 87 

ggagggtagc atgttgattg taacgatgac agagcgttgc tgcctgtgat caaatatcat 8847 

ctccctcgca gagatccgaa ttatcagcct tcttattcat ttctcgctta accgtgacag 8907 

gctgtcgatc ttgagaacta tgccgacata ataggaaatc gctggataaa gccgctgagg 8967 

aagctgagtg gcgctatttc tttagaagtg aacgttgacg atatcaactc ccctatccat 9027 

tgctcaccga atggtacagg tcggggaccc gaagttccga ctgtcggcct gatgcatccc 9087 

cggctgatcg accccagatc tggggctgag aaagcccagt aaggaaacaa ctgtaggttc 9147 

gagtcgcgag atcccccgga accaaaggaa gtaggttaaa cccgctccga tcaggccgag 9207 

ccacgccagg ccgagaacat tggttcctgt aggcatcggg attggcggat caaacactaa 9267 

agctactgga acgagcagaa gtcctccggc cgccagttgc caggcggtaa aggtgagcag 9327 

aggcacggga ggttgccact tgcgggtcag cacggttccg 1 aacgccatgg aaaccgcccc 9387 

cgccaggccc gctgcgacgc cgacaggatc tagcgctgcg tttggtgtca acaccaacag 9447 

cgccacgccc gcagttccgc aaatagcccc caggaccgcc atcaatcgta tcgggctacc 9507 

tagcagagcg gcagagatga acacgaccat cagcggctgc acagcgccta ccgtcgccgc 9567 

gaccccgccc ggcaggcggt agaccgaaat aaacaacaag ctccagaata gcgaaatatt 9627 

aagtgcgccg aggatgaaga tgcgcatcca ccagattccc gttggaatct gtcggacgat 9687 

catcacgagc aataaacccg ccggcaacgc ccgcagcagc ataccggcga cccctcggcc 9747 

tcgctgttcg- ggctccacga aaacgccgga cagatgcgcc ttgtgagcgt ccttggggcc 9807 

gtcctcctgt ttgaagaccg acagcccaat gatctcgccg tcgatgtagg cgccgaatgc 9867 

cacggcatct cgcaaccgtt cagcgaacgc ctccatgggc tttttctcct cgtgctcgta 9927 
aacggacccg aacatctctg gagctttctt cagggccgac aatcggatct cgcggaaatc 9987 

ctgcacgtcg gccgctccaa gccgtcgaat ctgagcctta atcacaattg tcaattttaa 10047 

tcctctgttt atcggcagtt cgtagagcgc gccgtgcgtc ccgagcgata ctgagcgaag 10107 

caagtgcgtc gagcagtgcc cgcttgttcc tgaaatgcca gtaaagcgct ggctgctgaa 10167 

cccccagccg gaactgaccc cacaaggccc tagcgtttgc aatgcaccag gtcatcattg 10227 

acccaggcgt gttccaccag gccgctgcct cgcaactctt cgcaggcttc gccgacctgc 10287 
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cctgtaggcg 


cgacaggcat 


tcctcggcct 


ccg cgcaaua 


CCyyCCaccy 


1 0647 


atcgaccagc 


ccaggtcctg 


gcaaagctcg 


tagaacgtga 


aggtgatcgg 


/-i ^ jr*Y /T 4™ O 

c ccgccgaua 


1 07 07 


ggggtgcgct 


tcgcgtactc 


caacacctgc 


tgccacacca 


gt tcgtcatc 


gucggcccgc 


107 67 
X U / 0 / 


agctcgacgc 


cggtgtaggt 


gatcttcacg 


tccttgttga 


cgtggaaaat 


gaccc uyt u c 


1 0ft97 
1 U oZ / 


tgcagcgcct 


cgcgcgggat 


tttcttgttg 


cgcgtggtga 


acagggcaga 


gcgggccgtg 


1 H Q Q7 


tcgtttggca 


tcgctcgcat 


cgtgtccggc 


cacggcgcaa 


tatcgaacaa 


ggaaagctgc 


1 O Q / 7 


atttccttga 


tctgctgctt 


cgtgtgtttc 


agcaacgcgg 


cctgcttggc 


ctcgctgacc 


11U.U / 


tgttttgcca 


ggtcctcgcc 


ggcggttttt 


cgcttcttgg 


tcgtcatagt 


tcctcgcgtg 


1 1 n £ 7 

HUD / 


tcgatggtca tcgacttcgc 


caaacctgcc 


gcctcctgtt 


cgagacgacg 


cgaacgctcc 


111 07 


acggcggccg 


atggcgcggg 


cagggcaggg 


ggagccagtt 


gcacgctgtc 


gcgctcgatc 


111 Q7 
11 1 O / 


ttggccgtag 


cttgctggac 


catcgagccg 


acggactgga 


aggtttcgcg 


gggcgcacgc 


1 1 9 A7 
1 1Z 4 / 


atgacggtgc 


ggcttgcgat 


ggtttcggca 


tcctcggcgg 


aaaaccccgc 


gtcgatcagt 


1 1 "i 07 
11.3 U / 


tcttgcctgt 


atgccttccg 


gtcaaacgtc 


cgattcattc 


accctccttg 


egggattgee 


11^67 


ccgactcacg 


ccggggcaat 


gtgcccttat 


tcctgatttg 


acccgcctgg 


tgee ttggtg 


11/19 7 
114Z / 


tccagataat 


ccaccttatc 


ggcaatgaag 


tcggtcccgt 


agaccgtctg 


gccgtccttc 


1 1 /I Q7 
1144 o / 


tcgtacttgg 


tattccgaat 


cttgccctgc 


acgaatacca 


gcgacccct t 


gcccaaatac 


1 1 RA7 


ttgccgtggg 


cctcggcctg 


agagccaaaa 


cacttgatgc 


ggaagaagtc 


ggtgcgctcc 


I 1 607 

I I D U / 


tgcttgtcgc 


cggcatcgtt 


gcgccacatc 


taggtactaa 


aacaat teat 


ccagcaaaai. 


I 1 667 

I I D O / 


ataatatttt 


attttctccc 


aatcaggctt 


gatccccagt 


aagtcaaaaa 


at age tcgac 


I 1 797 

II / z / 


atactgttct 


tccccgatat 


cctccctgat 


cgaccggacg 


cagaaggcaa 


tgtcatacca 


11787 


cttgtccgcc 


ctgccgcttc 


tcccaagatc 


aataaagcca 


ettactttge 


catctttcac 


11847 


aaagatgttg 


ctgtctccca 


ggtcgccgtg 


ggaaaagaca 


agttcctctt 


egggctttte 


11907 


cgtctttaaa 


aaatcataca 


gctcgcgcgg 


atctttaaat 


ggagtgtctt 


cttcccagtt 


11967 
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ttcgcaatcc acatcggcca gatcgttatt cagtaagtaa tccaattcgg ctaagcggct 12027 

gtctaagcta. ttcgtatagg gacaatccga tatgtcgatg gagt.gaaaga gcctgatgca 12087 

ctccgcatac agctcgataa tcttttcagg gctttgttca tcttcatact cttccgagca 12147 

aaggacgcca tcggcctcac tcatgagcag attgctccag ccatcatgcc gttcaaagtg 122 07 

caggaccttt ggaacaggca gctttccttc cagccatagc atcatgtcct tttcccgttc 122 67 

cacatcatag gtggtccctt tataccggct gtccgtcatt tttaaatata ggttttcatt 12327 

ttctcccacc agcttatata ccttagcagg agacattcct tccgtatctt ttacgcagcg 12387 

gtatttttcg atcagttttt tcaattccgg- tgatattctc attttagcca tttattattt 12447 

ccttcctctt ttctacagta tttaaagata ccccaagaag ctaattataa caagacgaac 12507 

tccaattcac tgttccttgc attctaaaac cttaaatacc agaaaacagc tttttcaaag 12567 

ttgttttcaa agttggcgta taacatagta tcgacggagc cgattttgaa accacaatta 12627 

tgggtgatgc tgccaactta ctgatttagt gtatgatggt gtttttgagg tgctccagtg 12687 

gcttctgtgt ctatcagctg tccctcctgt tcagctactg acggggtggt gcgtaacggc 12747 

aaaagcaccg ccggacatca gcgctatctc tgctctcact- gccgtaaaac atggcaactg 12807 

cagttcactt acaccgcttc tcaacccggt acgcaccaga aaatcattga tatggccatg 12867 

aatggcgttg gatgccgggc aacagcccgc attatgggcg ttggcctcaa cacgatttta 12927 

cgtcacttaa aaaactcagg ccgcagtcgg taacctcgcg catacagccg ggcagtgacg 12987 

tcatcgtctg cgcggaaatg gacgaacagt ggggctatgt cggggctaaa tcgcgccagc 13047 

gctggctgtt ttacgcgtat gacagtctcc ggaagacggt tgttgcgcac gtattcggtg 13107 

aacgcactat ggcgacgctg gggcgtctta tgagcctgct gtcacccttt gacgtggtga 13167 

tatggatgac ggatggctgg ccgctgtatg aatcccgcct gaagggaaag ctgcacgtaa 13227 

tcagcaagcg atatacgcag cgaattgagc ggcataacct gaatctgagg cagcacctgg 13287 

cacggctggg acggaagtcg ctgtcgttct caaaatcggt ggagctgcat gacaaagtca 13347 

tcgggcatta tctgaacata aaacactatc aataagttgg agtcattacc caattatgat 13407 

agaatttaca agctataagg ttattgtcct gggtttcaag cattagtcca tgcaagtttt 13467 

tatgctttgc ccattctata gatatattga taagcgcgct gcctatgcct tgccccctga 13527 

aatccttaca tacggcgata tcttctatat aaaagatata ttatcttatc agtattgtca 13587 

atatattcaa ggcaatctgc ctcctcatcc tcttcatcct cttcgtcttg gtagcttttt 13647 

aaatatggcg cttcatagag taattctgta aaggtccaat tctcgttttc atacctcggt 13707 
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ataatcttac ctatcacctc aaatggttcg ctgggtttat cgcacccccg aacacgagca 13767 

cggcacccgc gaccactatg ccaagaatgc ccaaggtaaa aattgccggc cccgccatga 13827 

agtccgtgaa tgccccgacg gccgaagtga agggcaggcc gccacccagg ccgccgccct 13887 

cactgcccgg cacctggtcg ctgaatgtcg atgccagcac ctgc.ggcacg tcaatgcttc 13947 

cgggcgtcgc gctcgggctg atcgcccatc ccgttactgc cccgatcccg gcaatggcaa 14007 

ggactgccag cgctgccatt tttggggtga ggccgttcgc ggccgagggg cgcagcccct 14067 

ggggggatgg gaggcccgcg ttagcgggcc gggagggttc gagaaggggg ggcacccccc 14127 

ttcggcgtgc gcggtcacgc gcacagggcg cagccctggt taaaaacaag gtttataaat 14187 

attggtttaa aagcaggtta aaagacaggt tagcggtggc cgaaaaacgg gcggaaaccc 14247 

ttgcaaatgc tggattttct gcctgtggac agcccctcaa atgtcaatag gtgcgcccct 14307 

catctgtcag cactctgccc ctcaagtgtc aaggatcgcg cccctcatct gtcagtagtc 14367 

gcgcccctca agtgtcaata ccgcagggca cttatcccca ggcttgtcca catcatctgt 14427 

gggaaactcg cgtaaaatca ggcgttttcg ccgatttgcg aggctggcca gctccacgtc 14487 

gccggccgaa atcgagcctg cccctcatct gtcaacgccg cgccgggtga gtcggcccct 14547 

caagtgtcaa cgtccgcccc tcatctgtca gtgagggcca agttttccgc gaggtatcca 14607 

caacgccggc ggccgcggtg tctcgcacac ggcttcgacg gcgtttctgg cgcgtttgca 14667 

gggccataga cggccgccag cccagcggcg agggcaacca gcccggtgag cgtcgcaaag 14727 

gcgctcggtc ttgccttgct cgtcggtgat gtacttcacc agctccgcga agtcgctctt 14787 

cttgatggag cgcatgggga cgtgcttggc aatcacgcgc accccccggc cgttttagcg 14847 

gctaaaaaag tcatggctct gccctcgggc ggaccacgcc catcatgacc ttgccaagct 14907 

cgtcctgctt ctcttcgatc ttcgccagca gggcgaggat cgtggcatca ccgaaccgcg 14967 

ccgtgcgcgg gtcgtcggtg agccagagtt tcagcaggcc gcccaggcgg cccaggtcgc 15027 

cattgatgcg ggccagctcg cggacgtgct catagtccac gacgcccgtg attttgtagc 15087 

cctggccgac ggccagcagg taggccgaca ggctcatgcc ggccgccgcc gccttttcct 15147 

caatcgctct tcgttcgtct ggaaggcagt acaccttgat aggtgggctg cccttcctgg 15207 

ttggcttggt ttcatcagcc atccgcttgc cctcatctgt nacgccggcg gtagccggcc 15267 

agcctcgcag agcaggattc ccgttgagca ccgccaggtg cgaataaggg acagtgaaga 15327 

aggaacaccc gctcgcgggt gggcctactt cacctatcct gcccggctga cgccgttgga 15387 
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tacaccaagg 


aaagtctaca 


cgaacccttt 


ggcaaaatcc 


tgtatatcgt 


gcgaaaaagg 


15447 


atggatatac 


cgaaaaaatc 


gctataatga 


ccccgaagca 


gggt.tatgca 


gcggaaaagc 


15507 


gccacgcttc 


ccgaagggag 


aaaggcggac 


aggtatccgg 


taagcggcag 


ggtcggaaca 


15567 


ggagagcgca 


cgagggagct 


tccaggggga 


aacgcctggt 


atctttatag 


tcctgtcggg 


15627 


tttcgccacc 


tctgacttga 


gcgtcgattt 


ttgtgatgct 


cgtcaggggg 


gcggagccta 


15687 


tggaaaaacg 


ccagcaacgc 


ggccttttta 


cggttcctgg 


ccttttgctg 


gccttttgct 


15747 


cacatgttct 


ttcctgcgtt 


atcccctgat 


tctgtggata 


accgtattac 


cgcctttgag 


15807 


tgagctgata 


ccgctcgccg 


cagccgaacg 


accgagcgca 


gcgagtcagt 


gagcgaggaa 


15867 


gcggaagagc 


gccagaaggc 


cgccagagag 


gccgagcgcg 


gccgtgaggc 


ttggacgcta 


15927 


gggcagggca 


tgaaaaagcc 


cgtagcgggc 


tgctacgggc 


gtctgacgcg 


gtggaaaggg 


15987 


ggaggggatg 


ttgtctacat 


ggctctgctg 


tagtgagtgg 


gttgcgctcc 


ggcagcggtc 


16047 


ctgatcaatc 


gtcacccttt 


ctcggtcctt 


caacgttcct 


gacaacgagc 


ctccttttcg 


16107 


ccaatccatc 


gacaatcacc 


gcgagtccct 


gctcgaacgc 


tgcgtccgga 


ccggcttcgt 


16167 


cgaaggcgtc 


tatcgcggcc 


cgcaacagcg 


gcgagagcgg* agcctgttca 


acggtgccgc 


16227 


cgcgctcgcc 


ggcatcgctg 


tcgccggcct 


gctcctcaag 


cacggcccca 


acagtgaagt 


16287 


agctgattgt 


catcagcgca 


ttgacggcgt 


ccccggccga 


aaaacccgcc 


tcgcagagga 


16347 


agcgaagctg 


cgcgtcggcc 


gtttccatct 


gcggtgcgcc 


cggtcgcgtg 


ccggcatgga 


16407 


tgcgcgcgcc 


atcgcggtag 


gcgagcagcg 


cctgcctgaa 


gctgcgggca 


ttcccgatca 


16467 


gaaatgagcg 


ccagtcgtcg 


tcggctctcg 


gcaccgaatg 


cgtatgattc 


tccgccagca 


16527 


tggcttcggc 


cagtgcgtcg 


agcagcgccc 


gcttgttcct 


gaagtgccag 


taaagcgccg 


16587 


gctgctgaac 


ccccaaccgt 


tccgccagtt 


tgcgtgtcgt 


cagaccgtct 


acgccgacct 


16647 


cgttcaacag 


gtccagggcg 


gcacggatca 


ctgtattcgg 


ctgcaacttt 


gtcatgcttg 


16707 


acactttatc 


actgataaac 


ataatatgtc 


caccaactta 


tcagtgataa 


agaatccgcg 


16767 


cgttcaatcg 


gaccagcgga 


ggctggtccg 


gaggccagac 


gtgaaaccca 


acatacccct 


16827 


gatcgtaatt 


ctgagcactg 


tcgcgctcga 


cgctgtcggc 


atcggcctga 


ttatgccggt 


16887 


gctgccgggc 


ctcctgcgcg 


atctggttca 


ctcgaacgac 


gtcaccgccc 


actatggcat 


16947 


tctgctggcg 


ctgtatgcgt 


tggtgcaatt 


tgcctgcgca 


cctgtgctgg 


gcgcgctgtc 


17007 


ggatcgtttc 


gggcggcggc 


caatcttgct 


cgtctcgctg 


gccggcgcca 


gate 


17061 
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<210> 35 

<211> 290 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 

<400> 35 

Met Glu Val Val Glu Arg Phe Tyr Gly Glu Leu Asp Gly Lys Val Ser 
1 5 10 15 



Gin Gly Val Asn Ala Leu Leu Gly Ser Phe Gly Val Glu Leu Thr Asp 
20 25 30 



Thr Pro Thr Thr Lys Gly Leu Pro Leu Val Asp Ser Pro Thr Pro lie 
35 40 45 



Val Leu Gly Val Ser Val Tyr Leu Thr lie Val lie Gly Gly Leu Leu 
50 55 60 



Trp lie Lys Ala Arg Asp Leu Lys Pro Arg Ala Ser Glu Pro Phe Leu 
65 70 75 80 



Leu Gin Ala Leu Val Leu Val His Asn Leu Phe Cys Phe Ala Leu Ser 
85 90 95 



Leu Tyr Met Cys Val Gly lie Ala Tyr Gin Ala lie Thr Trp Arg Tyr 
100 105 110 



Ser Leu Trp Gly Asn Ala Tyr Asn Pro Lys His Lys Glu Met Ala lie 
115 120 125 



Leu Val Tyr Leu Phe Tyr Met Ser Lys Tyr Val Glu Phe Met Asp Thr 
130 " 135 . 140 



Val lie Met lie Leu Lys Arg Ser Thr Arg Gin lie Ser Phe Leu His 
145 150 ■ 155 160 



Val Tyr His His Ser Ser lie Ser Leu lie Trp Trp Ala lie Ala His 
165 170 175 



His Ala Pro Gly Gly Glu Ala Tyr Trp Ser Ala Ala Leu Asn Ser Gly 
180 . 185 190 



Val His Val Leu Met Tyr Ala Tyr Tyr Phe Leu Ala Ala Cys Leu Arg 
195 200 205 
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Ser. Ser Pro Lys Leu Lys Asn Lys Tyr Leu Phe Trp Gly Arg Tyr Leu 
210 215 220 



Thr Gin Phe Gin Met Phe Gin Phe Met Leu Asn Leu Val Gin Ala Tyr 
225 . 230 235 240 



Tyr Asp Met Lys Thr Asn Ala Pro Tyr Pro Gin Trp Leu lie Lys lie 
245 250 255 



Leu Phe Tyr Tyr Met lie Ser Leu Leu Phe Leu Phe Gly Asn Phe Tyr 
260 265 270 



Val Gin Lys Tyr lie Lys Pro Ser Asp Gly Lys Gin Lys Gly Ala Lys 
275 280 285 



Thr Glu 
290 



<210> 36 

<211> 282 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 

<400> 36 

Met Glu Asn Phe Trp Ser lie Val Val Phe Phe Leu Leu Ser lie Leu 
1 5 10 15 



Phe lie Leu Tyr Asn lie Ser Thr Val Cys His Tyr Tyr Met Arg lie 
20 25 30 



Ser Phe Tyr Tyr Phe Thr lie Leu Leu His Gly Met Glu Val Cys Val 
35 40 45 



Thr Met lie Pro Ser Trp Leu Asn Gly Lys Gly Ala Asp Tyr Val Phe 
50 55 . 60 



His Ser Phe Phe Tyr Trp Cys Lys Trp Thr Gly Val His Thr Thr Val 
65 70 75 80 



Tyr Gly Tyr Glu Lys Thr Gin Val Glu Gly Pro Ala Val Val He Cys 
85 90 95 
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Asn His Gin Ser Ser Leu Asp lie Leu Ser Met Ala Ser lie Trp Pro 
100 105 110 

Lys Asn Cys Val Val Met Met Lys Arg lie Leu Ala Tyr Val Pro Phe 
115 120 125 

Phe Asn Leu Gly Ala Tyr Phe Ser Asn Thr lie Phe lie Asp Arg Tyr 
130 135 140 



Asn Arg Glu Arg Ala Met Ala Ser Val Asp Tyr Cys Ala Ser Glu Met 
145 150 155 160 



Lys Asn Arg Asn Leu Lys Leu Trp Val Phe Pro Glu Gly Thr Arg Asn 
165 170 175 



Arg Glu Gly Gly Phe lie Pro Phe Lys Lys Gly Ala Phe Asn lie Ala 
180 185 190 



Val Arg Ala Gin He Pro He He Pro Val Val Phe Ser Asp Tyr Arg 
195 200 205 . 



Asp Phe Tyr Ser Lys Pro Gly Arg Tyr Phe Lys Asn Asp Gly Glu Val 
210 215 220 



Val He Arg Val Leu Asp Ala He Pro Thr Lys Gly Leu Thr Leu Asp 
225 230 235 240 



Asp Val Ser Glu Leu Ser Asp Met Cys Arg Asp Val Met Leu Ala Ala 
245 250 255 



Tyr Lys Glu Val Thr Leu Glu Ala Gin Gin Arg Asn Ala Thr Arg Arg 
260 265. 270 



Gly Glu Thr Lys Asp Gly Lys Lys Ser Glu 
275 280 



<210> 37 

<211> 477 

<212> PRT 

<213> Phaeodactylum tricornutum, Physcomitrella patens, Caenorhabditis elegans 

<400> 37 



Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
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10 



15 



Arg. Lys lie Ser .Trp Gin Glu Val Lys Thr His Ala Ser Pro Glu Asp 
20 25 30 



Ala Trp lie lie His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 
.35 40 45' 



Glu His Pro Gly Gly Ala Val lie Phe Thr His Ala Gly Asp Asp Met 
50 55 60 



Thr Asp He Phe Ala Ala Phe His Ala Pro Gly Ser Gin Ser Leu Met 
65 70 75 80 



Lys Lys Phe Tyr He Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 
85 90 95 



Pro Gin Gin He Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 
100 105 110 



Leu lie Met Met Gly Met Phe Lys. Ser Asn Lys Trp Phe Tyr Val Tyr 
115 120 125 



Lys' Cys Leu Ser Asn Met Ala He Trp Ala Ala Ala Cys Ala Leu Val 
130 135 140 



Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val Met Leu 
145 . 150 155 160 



Gly Thr Phe Phe Gin Gin Ser Gly Trp Leu Ala His Asp Phe Leu His 
165 170 175 



His Gin Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly Leu Phe 
180 . 185 190 



Trp Gly Asn Leu Met Gin Gly Tyr Ser Val Gin Trp Trp Lys Asn Lys 
195 200 205 



His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser Ala Val 
210 215 220 



Ala Gin Asp Gly Asp Pro Asp He Asp Thr Met Pro Leu Leu Ala Trp 



121 



225 



230 



235 



240 



Ser Val Gin Gin Ala Gin Ser Tyr Arg Glu Leu Gin Ala Asp Gly Lys 
245 250 255 



Asp Ser Gly Leu Val Lys Phe Met lie Arg Asn Gin Ser Tyr Phe Tyr 
260 265 270 



Phe Pro lie Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu Ser Phe 
275 280 285 



Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala Leu Glu 
290 295 300 



Leu Lys Ala Lys Gly Leu Gin Tyr Pro Leu Leu Glu Lys Ala Gly lie 
305 310 315 320 



Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe Gly Arg 
325 330 335 



Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr Ala Ser 
340 345 350 



Cys Gly Phe Leu Leu Ala He Val Phe Gly Leu Gly His Asn Gly Met 
355 360 365 



Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gin Val 
. 370 375 380 



Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gin Ala Phe 
385. 390 395 400 



Val Asp Trp Phe Cys Gly Gly Leu Gin Tyr Gin Val Asp His His Leu 
405 410 415 



Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 
420 425 430 



Glu Ser Phe Cys Lys Glu rrp Gly Val Gin Tyr His Giu Ala Asp Leu 
435 440 445 



Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly 
450 455 460 
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Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala .Met 
465 470 . 475 



<210> 


38 


<211> 


47 


<212> 


DNA 


<213>. 


artificial sequence 


<220> 




<221> 


misc__f eature 


<222> 


(1) ,. (47) 


<223> 


synthetic oligonucleotide 


<400> 


38 


ccggaattcg gcgcgccgag ctcctcgagc 


<210> 


39 


<211> 


47 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


<1>..(47) 


<223> 


synthetic oligonucleotide 


<400> 


39 


ccggaattcg gcgcgccgag ctcctcgagc 


<210> 


40 


<211> 


47 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


(1) - . (47) 


<223> 


synthetic oligonucleotide 


<400> 


40 


ccggaattcg gcgcgccgag ctcctcgagc 


<210> 


41 


<211> 


48 


<212> 


DNA 


<213> 


artificial sequence 



<220> 

<221> mi sc_f eature 
<222> (1)..(48) 
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<223> synthetic oligonucleotide 

<400> 41 

aaaactgcag gcggccgccc accgcggtgg gctggctatg aagaaatt 

<210> 42 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1) . . (27) 

<223> synthetic oligonucleotide 

<400> 42 

cgcggatccg ctggctatga agaaatt 



<210> 43 

<211> 45 

<212> DNA 

<213> artificial sequence 
<220> 

< 2 2 1 > mi s c_f eature 

<222> (1) . . (45) 

<223> synthetic oligonucleotide 

<400> 43 

tcccccggga tcgatgccgg cagatctgct ggctatgaag aaatt 



<210> 44 

<211> 40 

<2.12> DNA 

<213> artificial 



sequence 



<220> 

<221> misc_feature 
<222> (1) . . (40) 

<223> synthetic oligonucleotide 
<400> 44 

aaaactgcag tctagaaggc ctcctgcttt aatgagatat 



<210> 45 

<211> 51 

<212> DNA 

<213> artificial sequence 
<220> 

<221> mi sc_f eature 

<222> (1)..(51) 
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<223> synthetic oligonucleotide 
<400> 45 

cgcggatccg atatcgggcc cgctagcgtt aaccctgctt taatgagata t 



<210> 


46 


<211> 


33 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


(I). .(33) 


<223> 


synthetic oligonucleotide 


<400> 


46 


tcccccgggc catggcctgc tttaatgaga 


<210> 


47 


<211> 


53 


<212> 


DNA . 


<213> 


artificial sequence 


<220> 




<221> 


misc_feature 


<222> 


(1)..(53) 


<223> 


synthetic oligonucleotide 


<400> 


47' 



cccaagcttg gcgcgccgag ctcgaattcg tcgacggaca atcagtaaat tga 



<210> 48. 

<211> 53 

<212> DNA. 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(53) 

<223> synthetic oligonucleotide 

<400> 48 

cccaagcttg gcgcgccgag ctcgaattcg tcgacggaca atcagtaaat tga 



<210> 49 

<211> 47 

<212> DNA 

<213> artificial sequence 
<220> 

<221> misc_f eature 

<222> (1)..(47) 

<223> synthetic oligonucleotide 
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<400> 49 

cccaagcttg gcgcgccgag ctcgtcgacg gacaatcagt aaattga 



<210> 


50 


<211> 


29 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


(1) . . (29) 


<223> 


synthetic oligonucleotide 


<400> 


50 


acataatgga gaacttctgg tcgatcgtc 


<210> 


51 


<211> 


24 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


mi sc_f eature 


<222> 


(1) . . (24) 


<223> 


synthetic oligonucleotide 


<400> 


51 


ttactcagat ttcttcccgt cttt 


<210> 


52 


<211> 


26 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


mi sc_f eature 


<222> 


(1)..(26) 


<223> 


synthetic oligonucleotide 


<400> 


52 


acataatgac cttcctagcc atatta 


<210> 


53 


<211> 


24 


<212> 


DNA 


<213> 


artificial sequence 



<220> 

<221> misc_f eature 

<222> (1)..(24) 

<223> synthetic oligonucleotide 
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<400> 


53 


tcagatattc aaattggcgg cttc 


<210> 


54 


<211> 


32 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


. (1) .. .(32) 


<223> 


synthetic oligonucleotide 


<400> 


54 


ttaagcgcgg ccgcatggag aacttctggt 


<210> 


55 


<211> 


31. . 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


(1)..(31) 


<223> 


synthetic oligonucleotide 


<400> 


55 


acctcggcgg ccgccctttt actcagattt 


<210> 


56 


<211> 


41 


<212> 


DNA 


<213> 


artificial sequence 


<220> 




<221> 


misc_f eature 


<222> 


(1) (41) 


<223> 


synthetic oligonucleotide 


<400> 


56 



acataatgga gaacttctgg tctattgttg tgttttttct 



<210> 57 

<211> 41 

<212> DNA 

<213> artificial sequence 



<220> 

<221> misc_f eature 

<222> (1)..(41) 

<223> synthetic oligonucleotide 



<400> 57 
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ctagctagct tactcagatt ' tcttcccgtc ttttgtttct c 



41 



